Gene Franean1_5664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5664 
Symbol 
ID5673991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6875780 
End bp6878005 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content73% 
IMG OID641244518 
Productinner-membrane translocator 
Protein accessionYP_001509921 
Protein GI158317413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0559] Branched-chain amino acid ABC-type transport system, permease components
[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.7047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGTA TCGATCTTGC TCTCGCCGGC ATCTCCATCG GGGCCATCGC GGCCCTCTCC 
GGCATCGGGC TGCTTGTCAC CTACCGCACC AACGGCGTTC TCAACCTCGC GCAGGGCGGA
ATCGCGACCC TGGTCGCGTA CGTGTTCCGC GAGATGGTCG TCGAGTGGGA CCTGCCGATC
TGGCTGGCCG CGGTGATCGC CCTCGGCATC CTCTCCCCCG GCATCGGGCT TCTACTCGAA
CGGCTCGTGT TCCGCCCGCT CGCCCGTCGA CGCGCCTCCG CCGCCGAGTC GCTGGTCGCG
AGCCTCGGCG TCCTGGTCCT CACGCTCGGC ATGACGGCCG GCATCTGGGG CCTGGGCTCC
CGCAGCGACG CGCCGAGTAT CTTCCCGAAC GAGTCGGTCA CGATCTTCGG TGACACCCGC
ATCGGCGTCG ACGCCCTGGC CGAGCTGGCC ATGGTCGTCG CGTTCTGCAT CGTGCTCGGG
CTCATCGCGG CGAAGACGAA GTTCGGCCGC CAGATCCGGG CCGTCGTCGA CGACCGCCAG
CTCGCCGAGC TCTCCGGCGT GCCCGCCGAC CGGGTCGCCG CGGCCGGGTG GGCGCTGGGC
ACCACGATGG CCGGCCTCAC CGGCATCCTG CTGGCGCCCC GGTTCCAGCT CAACCCGTAC
GGGCTGACGC TGCTCGTCCT GGAGACGTTC GCCGTCGTCG TGGCCGCCCG GCTGTCGAGC
ATGCCAGTGG CGGTGCTGAC CGCGCTGGGC ATCGCCGTCC TGCAGAGCGA GCTCAAGCAG
TTCACCCTCG AGGGCGACGC CGGCCAGATC CTCATCGTTC TGCAGTCCAA CCTGTTCATC
TTCGCGCTGC TGGTGCTGCT GCTCGTGGTA CCCAAGCTGC GCGAGCTGGG CAACGGCGAC
TCCGGCTCGA CCGGGAGCTT CTCCAGCCGG GGCGCTCCCC CCGCCGGCCC GGTCAGCCGG
CACGACGCCC GGCGCGACAT CCTCGGCAAG CTCGGCGGCG CCGCGCTGCT GCTGGCCCCG
CTGACCTTCG CCCCGGCCGA CCTGCGCTCG GCGTTCATGG TGCCCGCCCT GGCCCTGATC
TTCCTCTCGC TGGTGATCCT CACCGGCTAC AGCGGCCAGC TCTCCCTCGG CGTCGCCGGC
TACGCGGGCC TGGGCGCCCT GCTCACCCTC AAGCTCGCCA ACGGCGACCT GTTCGGCATC
CCGGCGATCC CCGGCATGTG GGCGATGTTC CTGGGCGCGC TGCTGGTCGC GCCGATCGGA
CTGATCACCG GGTGGCCGGC GATCCGCCGG CGCGGGCTGA CTCTCGCGCT GACGACCTTC
GCGGTCGGCG CGGTCGTGAG CCGCTTCGTC TTCGAGCAGC CGACCTTCGC GACCGGCCTG
TACGTCGACC CGCTGACGTT CTTCGGCTCC GAACTGACCG ACGAGGCCTT CTACATCTTC
GAGCTGGTCT GCCTGGCCAT CGGCCTGCTG GTTGTGCGCA ACCTGCACCA CGGCCGGCTC
GGCCGGGCCC TGCTCGCCGT CCGCGACCAC ACCGAGGGCG CCGCGGCGGT CGGGGTGGAC
GTCCGCAACC TCAAACTGCT CGCGTTCACC GTCTCCTCGG TCGTGGCCGG CCTCGGCGGC
GGGCTGCTGA CCTTCAGCTC CTCCTCGTTC TCGGCGGACG ACTTCGCGCC GTTGCAGAGC
CTGCTGTGGT TCACCGCCGT GATCGTCTTC GGTGCCGACA GCGCCGCCGG GGCGATCATC
GCGGCGGCGT TCATCGTCAC CATCGACGTG CTGGCCCCGG CCGGCTCGTC GATCCTCGCG
GTCGGCATCC TCGCCCTGGC GCTGGGCTGG ATGCCCGGCG GCCTCGCGTC CGCGGTCCGG
GCCGGCTTCG CGCTGGTCGC CCGCACGCTG GCCGACGAGT TCGTCCCCGC GCCGCGGCCC
ACCCGCCATG CCAGCGCCCG TCTCGCCGCG GGCGGCGCCG CGGGCCACGT GCCCGGTGTC
CCGGGCGCCG GCGGCGGAGC GGCTCTCGGC GAGCTGCGGA TCGTGCTGCC GCCGGGAGCA
CCGGCACCGA CCGCCTTCGG CGCGGCGCTG CTGAGCGCGG TCGCCGCACA CGCCGCCCGC
ACCGGGCTGA GCCTGGACGC GCTGCCGAGC GCCCGCACGG CGGCGGACGA CTCCACAGAC
GGCCGGGAGC TGGTCGGGCA CCGACCGCAC CTGCCCGGAT ACCCGGTCGA GAGAGGCGCG
TCATGA
 
Protein sequence
MTSIDLALAG ISIGAIAALS GIGLLVTYRT NGVLNLAQGG IATLVAYVFR EMVVEWDLPI 
WLAAVIALGI LSPGIGLLLE RLVFRPLARR RASAAESLVA SLGVLVLTLG MTAGIWGLGS
RSDAPSIFPN ESVTIFGDTR IGVDALAELA MVVAFCIVLG LIAAKTKFGR QIRAVVDDRQ
LAELSGVPAD RVAAAGWALG TTMAGLTGIL LAPRFQLNPY GLTLLVLETF AVVVAARLSS
MPVAVLTALG IAVLQSELKQ FTLEGDAGQI LIVLQSNLFI FALLVLLLVV PKLRELGNGD
SGSTGSFSSR GAPPAGPVSR HDARRDILGK LGGAALLLAP LTFAPADLRS AFMVPALALI
FLSLVILTGY SGQLSLGVAG YAGLGALLTL KLANGDLFGI PAIPGMWAMF LGALLVAPIG
LITGWPAIRR RGLTLALTTF AVGAVVSRFV FEQPTFATGL YVDPLTFFGS ELTDEAFYIF
ELVCLAIGLL VVRNLHHGRL GRALLAVRDH TEGAAAVGVD VRNLKLLAFT VSSVVAGLGG
GLLTFSSSSF SADDFAPLQS LLWFTAVIVF GADSAAGAII AAAFIVTIDV LAPAGSSILA
VGILALALGW MPGGLASAVR AGFALVARTL ADEFVPAPRP TRHASARLAA GGAAGHVPGV
PGAGGGAALG ELRIVLPPGA PAPTAFGAAL LSAVAAHAAR TGLSLDALPS ARTAADDSTD
GRELVGHRPH LPGYPVERGA S