Gene Franean1_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2025 
Symbol 
ID5670426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2433825 
End bp2435093 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content71% 
IMG OID641240946 
Producthypothetical protein 
Protein accessionYP_001506368 
Protein GI158313860 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0203629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.690247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGTAT TTCGTACGAG GTTGCTCGCC GCGGCGCTCG TGACGGCCGC CCTGGCGGCG 
GCCTGCGGCT CCAGCGAGTC CTCCGGCGTG GACGCGGCGG CGTCGCCCTG TGCTCCCGGA
GTCACGGACG ACGAGGTCAA CGTCGGTATT GTCTACCCGG ACACCGGTGT GCTCTCGGCA
CAGTTCACCG GCTACCGGTT CGGCGTCGAG GCCCGGTTCG CCGAGGCGAA CGCCGCCGGT
GGCGTCGACG GCCGGCAGAT CTCGACGATC TGGCGGGACG ACGAGTTCGA CTCCGCCGGC
AACCTGCAGG CGGCCCGCGA GCTTCTCCGC GAGAACGTGT TCGCGGTTCT CGAGTACACC
GCGCATTCCG AACAGTCCAC GCCGCTGTTG CACGACAAGG GAATTCCGGT GGTGGGCGTC
GCGGACCAGG CCGGGTGGGC CGACAACGAC AACATGTTCC CGCTCACGTA TCAGGTCGAC
GACACCGACG CGACCAGCAC TCTCGGAGAT TTCGTCCGGG CCCAGGGCGG CACCCGCGCG
GCCCTCATCA CGACGACGAT CACCGAGTCG TCGGTCATGT ACGCCCAGAA CGCCCGCCGG
AGCCTGGAGG CTGCCGGCAT CCCGGTCGTC TTCGCGGACA CGAACGCCAC GGTGAGCTCA
CCGGAGGCCG TGCAGCGGAT CGTCGGCAGC GGCGCCGACA CGCTGATCTC GCTCGCCTCG
CTCGACCTCT ACTCCGGCGC GGTGACGGCG GCGGCCGCGG CGAACAGGCC GTTCAAGGTG
GCGGTCTCGC CCATCACCTA CGACGCGCAC CTGCTCAACA CGCCGATCGC GCGCGCTCTG
GCCGGCACCT ACTCGACCGT GGGCTTCAGC GCGATCGAGC GGAACCTGCC CGCGCACCGC
GCCTACCTGG CCGCGATGAC CACCTACGCG CCGGAGATCC AGCCGCCGAC GCAGCTGAGC
TCCATCTACG GCTACATCAC CGCGGACCTG TTCCTGCGGG GCCTGCAGGG GCAGCAGGGC
TGCCCCACCC GCGAGTCGTA CATCCGCGGG CTGCGCGGTG TGACCGACTA CGACGGGGGC
GGACTGCTCA ACCAGCCGGT TGACCTGTCC GCCGGCAAGC GCACGGTCGA CCTGTGCACG
GACTTCGTCC GGGTCTCCGC CGCCGGGAAC GCCTTCGAGC CCGTCGGCGC GAAGCCGCTG
TGCGGCCAGG TGATCAGCGG CGCGGCCCAG GGAGGCGCCG GGGCGTCCGT GCCAGCCCGG
GGGCGGTGA
 
Protein sequence
MLVFRTRLLA AALVTAALAA ACGSSESSGV DAAASPCAPG VTDDEVNVGI VYPDTGVLSA 
QFTGYRFGVE ARFAEANAAG GVDGRQISTI WRDDEFDSAG NLQAARELLR ENVFAVLEYT
AHSEQSTPLL HDKGIPVVGV ADQAGWADND NMFPLTYQVD DTDATSTLGD FVRAQGGTRA
ALITTTITES SVMYAQNARR SLEAAGIPVV FADTNATVSS PEAVQRIVGS GADTLISLAS
LDLYSGAVTA AAAANRPFKV AVSPITYDAH LLNTPIARAL AGTYSTVGFS AIERNLPAHR
AYLAAMTTYA PEIQPPTQLS SIYGYITADL FLRGLQGQQG CPTRESYIRG LRGVTDYDGG
GLLNQPVDLS AGKRTVDLCT DFVRVSAAGN AFEPVGAKPL CGQVISGAAQ GGAGASVPAR
GR