Gene Franean1_1283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1283 
Symbol 
ID5669696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1546753 
End bp1550016 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content75% 
IMG OID641240215 
ProductABC transporter related 
Protein accessionYP_001505643 
Protein GI158313135 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.887426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.865283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGACG TCATGGGCTA CGCCCTGTTG AGCCTGGGTG CCGGCGCGCT GTACGCGCTC 
GTCGCCACCG GCGTGGTCGT GATCCAGCGC GGTGCCGGCG TGTTGAACAT GGCGCAGGGC
GCGCTGCTGG CGTGGGCCGC GTACGCGTTC CACGGGGCGC GCGACGAGTG GGGTCTGCCC
GCGGCGCCGG CGGCGGTGCT CGCCGTCGGG TCGACGATGG CGGTCGGCCT GGTGTTCCAC
CAGCTGGTAC TCCGCCCGAT GCGCCAGGCC ACGGCCGTGC TGCGCCTGAT GGCCACGCTC
GGCCTGCTGA TCGTCATCCA GTCGCTGCTG ACCCTGATGT ACGGCGGGGA TGTCCGGCGT
TCGCCCACCG TGCTGCCGAC CGGCAAGGTG ACCCTGTTCG ACACCGCGGT CGGGGTGGAC
GTCCTGGTGC GGCTCGCCGT GGTGGCTGCC GTCGTGGCGG GGCTGTGGGC GGTGTTCCGG
TTCACCACGC TCGGGCTCGC GGCGACCGCG GTGACGGAGA ATCCGCAGGC CGCGGCGTCG
CTGGGCTGGT CGCCGGACGC GGTGGCGAGC TGGGCGTGGC TGGCCGGGTC GGCGCTGGCG
GGGCTGGGCG GTGTGCTGCT CGCGCCGCTG CAGAATCCGT TGTCGGCCGG TGGTCTGATG
CTGCTGATCG TCCCGGGGCT GGCGGTGGCG CTGGTCGCCC GGTTCCGGTC GCTGCCGGCG
GTGCTGGTGG GGGCGCTGGC GCTGGGCATC GTGGAGCTGG AGACCCAGGT CTACCTGGTC
GCGGACCATC CCGCCTGGCG GGGTGTCGAC CGTGCCGTCC CGCTCGCGGT GATCGTGTTC
TACCTGGCGG TGCGCGGGCG CGGCATTCCC GACCGGGGCC ATGTCGCCGA GCGCCATCCG
GTGCTGGGTA GCGGCCGGGT GCACTGGCGG GCACTGGGGA TCACGGTCGG CTGCGCGCTT
GTCGCCGTGT GGTGGCTGCT GCCTGCGTCC TGGGTGAACG CGCTGAACGC GAACGCCATC
TGGGCGACGA TCATCCTGTC AGTCGTGGTG CTGGTCGGCT TCACCGGGCA GCTCTCGCTG
GCGCAGCTCG CCTTCTCCGG GATCGCGGCA CTGATCGCCG GACGGCTGGT CGCCACCCGG
GGCTGGCCGC TGGAGGCTGC CCTCGTGGTG GGCGTCGCCG GGACGGCGGT CGTCGGCGTG
CTGTTCGCGC TGCCGGCGCT GCGCACCCGG GGCCTGCAAC TGGCCGTCGT GACGCTCGGC
CTCGGTGCTG CCGTGGACGC GCTGCTGTTC CAGCGCGGTT ACCACTCGCC CGCGGCAAGC
CCGCTCGGTG CCCTCTTCGG CGATCTCGGC ACGCTCGAGG GGACGGAGGT CGGCGACGCG
ACCCTGTTCG GGATCAGTCT GGACAAGGTG ACGCACCCGC GCGGGTTCGC GACCCTGTCG
CTGCTCACGT TCGTCCTGCT CGGGCTGGCG GTGGCGAACC TGCGGCGGGG GCGCGCGGGA
CGGCGGCTGA TCGCCGTCCG GACGAACGAG CGGGCCGCGG CGGCGCTGGG CATCAGCGTG
GTCGGGGCGA AGCTTTACGC CTTCGCGCTG TCCGCCGCGA TCGCCGGCTT CGGGGGAGTG
CTGTACGCGT TCTACACCTA CGGCGAACGC GGCAACATCG ACTACGGCGG CGGCCTGTTC
TCGCCGTTCG CCTCGATCCT GCTGATCGCC TACGCGGTCG TCGGGGGAGT CGGCTGGATC
AGCGGTTCCT TCGCCGGCGC GACGATGGCC GCCGAGGCGC TGGCGACGAG AGCCGGGGCC
TGGGTCGGCA GCGTCCTCGG CCAGCTCGGC CTGCTGCTGC GCCTGCTCTT CGCCGCGGCC
GCCGGCCTGC TGGGCCTCGC GGTCGGCCGC GCCGTCGTGC CAGGTACGGG GCCGACAGTG
CCGGCCGCAG GCGGGACGGC GCCGGGCGCG GTGCGGGCCG CGCACCTGCG GCGGGCGGCG
CCGTGGGTGG TGACCGCGGC CTTCGCCGGA GCCGCCTTCG CGGGCGGGGG CAGGGTCGTC
GACTGGCTGG CCGACCTTGA CCGCTACGTC CCGCTGATCG GCGGCCTGGT GCTGGTCACG
GTCCTCTCCC GGTCCGGTGG TGGGATGGCG CCGGAGAACG CCCGTACCGC CCGCCGGATC
CTGGAGCGCC GCTTCCCGAA GGCCGTCGAG CGGCGGATGG CCGCCGATGC GGCCCGGGTC
GCCCGCCTGC TCGGCCCCGC TCCGACCCTC GGCCCCACTC CGACCCTCGA ACCAGCTCTC
GCCACCCCGA ACGCCACCAA TGCCTCCGCT CCCGCCGCAG CGGCGGGCCC GGCGACGGGA
CCGATCCGCC GGGTCGGCAC GGTCGCCTAC CCGGCCCGGC CGGCCACGCT GTCGGTCTGG
GGACTGTCGG TGAGCTTCGG GCCGGTGGCT GCGGTCCGCT CGGTCGACCT GCGGGTCGAG
CCGGGACGGG TCGTCGGCGT GATCGGCCCG AACGGTGCGG GCAAGACGAC GGTGATCGAC
GCGATCACCG GCTACACCTC GTCCGTCTCG CGGTCCCTGA TGCTCGGCGA CACCCGGCTC
GACCGGCTGC CGGCCCACCT GCGTGCCCGT GCCGGGATCA GCCGGTCGTT CCAGAACCTC
GAGCTTTTCG AGGATCTCAC GGTCATCGAG AACATCCAGG CCGCCTGCGA CCCGCGGGAC
GCCCGTGCCT ACGCCGGTGA CCTGGTGCTG CCGCGCTCCC GGCCGCTGCC CGCCGCGGCC
GCCGCCGCCG TGGAGGCCTT CGGCCTGCGC GAGGACCTGC TCCGCACGGT CTCTGAGCTC
TCCTACGGCC GCCGCCGGCT GCTGGCGATC GCCCGCGCGG TGGCGACCTG CCCGTCCGTG
CTGCTGCTGG ACGAGCCGTG CGCGGGGCTC GACGAGAACG AGAGCGCCGG GGTGGCGACG
CTGCTGCGGC GGCTGGCGGA CAACTGGGGG CTGGGAATCC TGCTCAACGA GCACGACATG
GACGTGGTGA TGCGGATCTG CGACCAGGTG GTCGTTCTCG ACGGCGGTGA GGTGATCGCG
CAGGGCACCC CGGGCCAGGT ACGGGTCGAC CCGCGGGTGC GCCGGGCCTA CCTGGGCGCG
GCGACCGGCA CCGGCCGCAC ACCGGCGGCA GGCACGCCCG GCTCGGCGGG TGCGGCAGCC
GGCGCGACAG GCACGGCCAG CTCGCCGGGA ACAGCGGGTA CGGCCGCCAG CGCCACGCGA
CCGGCTGGCC GGGGCCGGCG GTGA
 
Protein sequence
MDDVMGYALL SLGAGALYAL VATGVVVIQR GAGVLNMAQG ALLAWAAYAF HGARDEWGLP 
AAPAAVLAVG STMAVGLVFH QLVLRPMRQA TAVLRLMATL GLLIVIQSLL TLMYGGDVRR
SPTVLPTGKV TLFDTAVGVD VLVRLAVVAA VVAGLWAVFR FTTLGLAATA VTENPQAAAS
LGWSPDAVAS WAWLAGSALA GLGGVLLAPL QNPLSAGGLM LLIVPGLAVA LVARFRSLPA
VLVGALALGI VELETQVYLV ADHPAWRGVD RAVPLAVIVF YLAVRGRGIP DRGHVAERHP
VLGSGRVHWR ALGITVGCAL VAVWWLLPAS WVNALNANAI WATIILSVVV LVGFTGQLSL
AQLAFSGIAA LIAGRLVATR GWPLEAALVV GVAGTAVVGV LFALPALRTR GLQLAVVTLG
LGAAVDALLF QRGYHSPAAS PLGALFGDLG TLEGTEVGDA TLFGISLDKV THPRGFATLS
LLTFVLLGLA VANLRRGRAG RRLIAVRTNE RAAAALGISV VGAKLYAFAL SAAIAGFGGV
LYAFYTYGER GNIDYGGGLF SPFASILLIA YAVVGGVGWI SGSFAGATMA AEALATRAGA
WVGSVLGQLG LLLRLLFAAA AGLLGLAVGR AVVPGTGPTV PAAGGTAPGA VRAAHLRRAA
PWVVTAAFAG AAFAGGGRVV DWLADLDRYV PLIGGLVLVT VLSRSGGGMA PENARTARRI
LERRFPKAVE RRMAADAARV ARLLGPAPTL GPTPTLEPAL ATPNATNASA PAAAAGPATG
PIRRVGTVAY PARPATLSVW GLSVSFGPVA AVRSVDLRVE PGRVVGVIGP NGAGKTTVID
AITGYTSSVS RSLMLGDTRL DRLPAHLRAR AGISRSFQNL ELFEDLTVIE NIQAACDPRD
ARAYAGDLVL PRSRPLPAAA AAAVEAFGLR EDLLRTVSEL SYGRRRLLAI ARAVATCPSV
LLLDEPCAGL DENESAGVAT LLRRLADNWG LGILLNEHDM DVVMRICDQV VVLDGGEVIA
QGTPGQVRVD PRVRRAYLGA ATGTGRTPAA GTPGSAGAAA GATGTASSPG TAGTAASATR
PAGRGRR