Gene Franean1_4564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4564 
Symbol 
ID5672911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5445130 
End bp5448099 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content71% 
IMG OID641243427 
ProductABC transporter related 
Protein accessionYP_001508843 
Protein GI158316335 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAGCA CCGACCTCAT CTTTCTGATC CTCGGCCTCG GTAATGGCGC CGTCTACGCG 
GCGCTCGGGC TCGGCCTGGT CCTCACCTAC CGCAGCTCGG GTGTGGTCAA TTTCGCCACC
GGTGCGGTTG CCCTCTACAC CGCCTACACC TATGCCTTCC TTCGGCAGGG GAAACTGCTC
AACCCGATCC CCGGCTTAAC CGGGACCGTC GATCTCGGGA TCGACGGGAT GGGCTTCCCG
GCGGCGTTCG CACTCTCGCT GGTGGTCGCC GCGGCGCTGG GATCACTGCT GTACGCGGCT
GTCTTCCGTC CGATGCGGGC GGCACCCACC GCCGCGAAGG CCGTCGCGTC GATCGGCGTG
ATGATCGTCA TGCAGGCGCT GCTCGCGGTC CAGGTGGGCA CGACCGCGGT CTCGGTGGCG
GCCATCCTGC CGACGCGCAT CTACACCGTC GCCGGGCAGC GGGTTCCCGG CGACCGGCTC
TGGTTCGCCG GCATCATCAT CGTGATGGCG GTCGCCCTGA CACTGTGGTT CCGGCTCACC
CGGTTCGGCC TGGCCACCCG CGCGGCGGCC GAGTCCGAGA AGGGCGCCCT GGTCACTGGG
CTGTCGCCGC AGCGCATCGC GCTGGTCAAC TGGGCCCTGA GCACCATGAT CGCCGGTGTC
GGCGGCATCC TCATCGCGCC CATCGTTCCG CTCACCCCGG TGTCCTACAC CCTGTTCATC
GTCCCGGCGC TGGCGACGGC GCTCGTCGGG AACTTCACCA GGATCGGTAC GGCGGTCTCG
GCCGGGCTCG TCATCGGCAT GCTGCAGTCG GAGGCCACGA ACCTGCAGAC CAGGAGCTGG
CTGCCGTCGT CCGGCCTCGC CGAGCTCGTC CCGCTCGCCG TCATCCTGAT CTTCCTGGTC
TTCCGGGGGC AGACCCTGCC CTCGCGCGGC TCGATCGTGC AGCAGACCCT CGGCCGGGCG
CCGCGACCGA AGTCGATGGT GCTGCCCGGT GTGGTCATCG CCGCCGCCGG GTTCGCCGCG
CTGGCGGCGA CGCACGGTTC CCACCGGGCA GCGATCATCA CCACGCTCGT CCTGGCGATC
ATCGCGTTGT CCCAGGTGGT CGTCACCGGC TTCAGCGGCC AGATCTCCCT GGCGCAGCTG
ACCCTGGCCG GCGTCGGCGC GTTCGCGCTG ACCCGGATCC AGCACCAGCT GCACGTGCCG
TTCCCGATCG CGCCGCTGCT CGCCGCCGTC TTCGCGACGA TCGTGGGGGT GGTGGTCGGC
CTGCCGGCGC TGCGGATCCG CGGCCTGCCG GTGGCGGTCA CCACGCTCGC GCTGGCGGTG
GTGCTCGAGA AGCTGTGGTT CACCAACAAG GACCTCAACG GCGGGTTCAA CGGCTCGCCG
ATCGACGACG CGAGCATCTT CGGCGTCAAT CTCGGGATCG GCGCCGGCGC CGGCTACCCA
CGCCTCACGT TCGGCCTGTT CTGCCTGGTC GTCCTGCTGC TGGTGGCCGG TGGCGTGGTG
CTGTTGCGCC GCAGCCGGCT GGGGGCGGCG ATGCTGGCGG TACGCGCCAA CGAACGATCG
GCCGCCGCCT CGGGCATCTC CGTGTCGCAG GTGAAGCTCG TCGCCTTCGC CATCGGCGGG
TTCCTCGCCG GGCTGGGCGG CGCCATGCTG GCCTACCAGC AGACCGTGGC GGACTCCAGC
TCGTACACCG CGATGGGCTG CGTCGCGCTG TTCGCCACCG CGTACCTGGC CGGCGTCACG
TCGGTGTCCG GCGGCATCAA CGCCGGGCTC ATCGGCGCGG GCGGCATCAT CTTCACCCTG
GTCGACAAGG GCCTGCCGCT GGGCGTCTAC GTCCTGCTGG TCGGGCTGCC CGTCCTCGCG
ATCGTCGCCG AGATCCGACC GAAGCTGATC CCGGCGCTGT CCGGTGTGGT CGGGGTGCTG
ACCCTGGCGG CCTTCCTCCT CCACCGTGAC ACGGTGGCCC TGGGCGACTA CTACACGACC
GTCAGCGGCG TCCTGTTGGT GCTCACCGTC ATCCTCAACC CGGAGGGCAT CGTCGGGCCG
GTGCACGAAC ATCTGGGCGC GCTGCGCACC AGGCTCGGAC GCCGATCGCC GGCCTCCCTC
GCCCGGCCGG CCGGAAACGC ACCGGCCAGG GACGCGGTCG CGGCAACCAC AGCCTCCGAG
CTGGCGGTAC CCCACGACAC CCACGAGGTC ACGGCGGGCC CGCTGCTGCG GGTCAGCGGC
GTCGGCGTGC GCTACGGCGC CGTCGTCGCG AACCAGGACG TGAGCTTCGA CGTCGACCGC
GGCGAGATCG TCGGCCTCAT CGGCCCGAAC GGCGCCGGCA AGACGACCCT GATCGACGCG
ATAAGCGGCT ATGCCGACGC GACCGGGTCG ATCGAGTTCC TCGGCCGCAG GCTCGACGGG
CTCAAACCCC ATCAGCGGAG TCGGCGCGGC CTCGGCCGGA CCTTCCAGGG CATCGAGCTG
TACGACGATC TCAGCGTCCG GGAGAACGTC CAGACGGGCA CGATGGCGGT CCGCTCGTCG
GGCGACCGCG CCACGCCGTC GCCCGTGGAC ATCGACCGGT TGTTCACCAT CCTGCATCTC
GAGGCGGTGT CCGAAGCCCC GGTGCGGGAG CTGTCCGTCG GGCAGCGTCA GCTGGTCTCC
GTCGCCCGCG CGCTGGCCGG GCGTCCGGCG ATCGTCCTGC TGGACGAGCC GGCCGCCGGG
CTGGACACGA CCGAGAGCCG CTGGCTGGGC GAGCGGCTGC GGGCCATCCG CGACGCCGGC
ATCACCATCG TCATGGTCGA CCACGACATG GGCCTCGTCC TGGACGTGTG CGACCGGATC
GTGGTGCTCA ACCTCGGGGA GGTAATCGTG GTCGGCACAC CGGCAGAGAT CCAGCGCAAC
CCGGAGGTCA CCCGCGCCTA CCTGGGCACG ACGCACGGGC ACGCGGAGCA CGGGCACGCA
GAGCGCGTGC ATGCGGAGGA GGTGCGATGA
 
Protein sequence
MGSTDLIFLI LGLGNGAVYA ALGLGLVLTY RSSGVVNFAT GAVALYTAYT YAFLRQGKLL 
NPIPGLTGTV DLGIDGMGFP AAFALSLVVA AALGSLLYAA VFRPMRAAPT AAKAVASIGV
MIVMQALLAV QVGTTAVSVA AILPTRIYTV AGQRVPGDRL WFAGIIIVMA VALTLWFRLT
RFGLATRAAA ESEKGALVTG LSPQRIALVN WALSTMIAGV GGILIAPIVP LTPVSYTLFI
VPALATALVG NFTRIGTAVS AGLVIGMLQS EATNLQTRSW LPSSGLAELV PLAVILIFLV
FRGQTLPSRG SIVQQTLGRA PRPKSMVLPG VVIAAAGFAA LAATHGSHRA AIITTLVLAI
IALSQVVVTG FSGQISLAQL TLAGVGAFAL TRIQHQLHVP FPIAPLLAAV FATIVGVVVG
LPALRIRGLP VAVTTLALAV VLEKLWFTNK DLNGGFNGSP IDDASIFGVN LGIGAGAGYP
RLTFGLFCLV VLLLVAGGVV LLRRSRLGAA MLAVRANERS AAASGISVSQ VKLVAFAIGG
FLAGLGGAML AYQQTVADSS SYTAMGCVAL FATAYLAGVT SVSGGINAGL IGAGGIIFTL
VDKGLPLGVY VLLVGLPVLA IVAEIRPKLI PALSGVVGVL TLAAFLLHRD TVALGDYYTT
VSGVLLVLTV ILNPEGIVGP VHEHLGALRT RLGRRSPASL ARPAGNAPAR DAVAATTASE
LAVPHDTHEV TAGPLLRVSG VGVRYGAVVA NQDVSFDVDR GEIVGLIGPN GAGKTTLIDA
ISGYADATGS IEFLGRRLDG LKPHQRSRRG LGRTFQGIEL YDDLSVRENV QTGTMAVRSS
GDRATPSPVD IDRLFTILHL EAVSEAPVRE LSVGQRQLVS VARALAGRPA IVLLDEPAAG
LDTTESRWLG ERLRAIRDAG ITIVMVDHDM GLVLDVCDRI VVLNLGEVIV VGTPAEIQRN
PEVTRAYLGT THGHAEHGHA ERVHAEEVR