Gene Franean1_7172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7172 
Symbol 
ID5675473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8753255 
End bp8755342 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content71% 
IMG OID641246009 
Producthypothetical protein 
Protein accessionYP_001511397 
Protein GI158318889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.417277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCAAG CGAGGCAGAT TCCGACACGT CAACCGCTCT ATGTCGTCTC CGACGGCTCC 
CGTCGTATGG ATTCCGCCCG GCTGACGGCC GCCAATCTGC TGTTGGGCCA CCTGGTCGAC
GGAGTCACGT CAACACCCGA GCTGGCGGCA GCCGTTCCGC TGTGCGTCCA GGAGTTCGAC
CAGGAATCGA CGACGCTGAT GCGGCTAGCC CCGTTGGATC CGGCCGTGCC GGTCCGCGAG
CTGACGGCCA GCCTCGCGGA GCCATCCCTG ACAGCGGCGT TCACCGGGCT GCGGGCGACA
GCCACCGGTG ACCTCGCCCA GCTGTTGGCC GACGACTTCG AGGTGGGTCA GCCCATCGTG
GTCGTCATCA CCAGCGGCCA CTCCCGCGAC ACCGAGGCCG ACCGGCTGCG GGCCTTCACC
GAACTGGTGA CGACGAACCT GTGGCCGGAC CAGACCGCCG ACTGGGAACC GTCGGTGTAC
CTGTTCGCCC TCGACGCTCC GGACTCGGCG ATGCTCGATC GGTTCGCCAG CTCCGGTGCC
CGTTGCCGGG TCCGCCCGGT GGCCGTCGGC ACCGACCCCG CACCCGATGT CGCCGACGTG
ATCAGGGATC TGGAGCTGCG TGTCCAGGGC GGGGCGCAGG GGATGAAGGT GGCACGCTGG
GTGCTGTCGG TCGCCCAGCT CGGCTCCACC GCGCCGACCG CTCGCGTCGA CGGCGTGGAG
ATTCACGAGA TCCCTGAGCT GACTCTGCCA TGGGTCACGA AGCCGCTGTG GTACCGCCGC
TTCCTCGACA TCACGGACGC GGACGCCCAT CCGTTCGACC CGCTGGTGGA CCTCGTCGCC
CTGTCCGACG AACCGCCTCC CCCGGGCTCC CCGGGGCAGC TCGCCGACCA GGCCTGCTGG
CCGGTGCGAG TCGTGGTCGC GGACGACTCG CCGACCGCCG TCGTCGGGCT GGTCGCTCCG
AGGCCGCCGG AGCGGTTCGT CGACGACGAC GGAGCACAGC GCCGGGACCG CACCGCCGCC
CAGCTTCACC TGGATGCCGC GGCGCTGCCG CTGGACCCCG GAAAGGTACC GGCCGCCACC
GACAGCCTCA TCCGGGCCCA GCTGTGCGAG CGGCTGGCCG CCGCCGTCGC CACCGCCCAC
CACTACGGCG TGCGGCTCGG CCGGCAGACG CTCGAGAGCG CGGTCTACGC CCTCGACCCG
GAGCCCGACG TGCTCCTCGT CGACTGCGAC ACCGCGAAGC TCGATCCGTC CGATCAGGCC
AGTCCACAGG AGGACCTGAC CTGGCTGGCG CGGTTCGTCG AACGCTGCGT GGACGACCAG
CAGCTGCCGC CGGTCGCCCT CGGCGAGGAG GCGCCGCCGG TGGTCCTGGA CGCCACCGGC
TGGAAGATGA TCGCCGACGC GAAGTCCAAG GTGGGACTGG CCGTGCCGTC CGCCAGTCGC
TGGCAGCGCT ACCTGGCCGA CCGGGTGCTG GAGCTGCGTG GGCCGCCCAC CGTCACCGCC
GTGCGGGTGA GCCCGGTCCT CGTCCCTCGC GGCGAGAAGG TCACGGTCCG CTGGCGAAGT
CGGTACGCCG AGTCGATGAT CGTGATCAGT CCGGACGGCA AGCAGATCCA GGTGCCGGCG
AAGCAGCTCG CGGACGGCGC CGCGCGCATG ACCGTGACCG CCGCCGGGCC GGTCCGGTTC
CGGGCGGTCA ACCAGGTCGG CACGACCGAG CTGGCCAGCG ACTGGATTCA CGTCTTCGAC
CTCCCGACGG GCGCGGATGT CGACTATCCG AAGATCTCGA ACCTGCCGGC GATCTGGCTC
GACGGCATGA TCATGAACAC CTGGGCCTTC GACGACACGA ATTTGGCCGC GATGCTGCCG
GCGATACCCG GCGGTGCCGG TAACAGCGAG GGCAGGGGGC GCGCCGGCCA CGTCGGCCCC
GTAGGCCGGT CGGGTCGGGT CGGCCGGGCC GGCGGGCGTG GCGGCGACCC CGCGGCCTCG
GTGCCGGGGC GCGCCGAGTT CCCGATCGAC CCGACGACCT GGTTCGCCAA CCCGCCGGAG
ATTCCCCGAC GCGGCCGCGC GAGGAGATGG AAACTGCCAT GGACGTGA
 
Protein sequence
MSQARQIPTR QPLYVVSDGS RRMDSARLTA ANLLLGHLVD GVTSTPELAA AVPLCVQEFD 
QESTTLMRLA PLDPAVPVRE LTASLAEPSL TAAFTGLRAT ATGDLAQLLA DDFEVGQPIV
VVITSGHSRD TEADRLRAFT ELVTTNLWPD QTADWEPSVY LFALDAPDSA MLDRFASSGA
RCRVRPVAVG TDPAPDVADV IRDLELRVQG GAQGMKVARW VLSVAQLGST APTARVDGVE
IHEIPELTLP WVTKPLWYRR FLDITDADAH PFDPLVDLVA LSDEPPPPGS PGQLADQACW
PVRVVVADDS PTAVVGLVAP RPPERFVDDD GAQRRDRTAA QLHLDAAALP LDPGKVPAAT
DSLIRAQLCE RLAAAVATAH HYGVRLGRQT LESAVYALDP EPDVLLVDCD TAKLDPSDQA
SPQEDLTWLA RFVERCVDDQ QLPPVALGEE APPVVLDATG WKMIADAKSK VGLAVPSASR
WQRYLADRVL ELRGPPTVTA VRVSPVLVPR GEKVTVRWRS RYAESMIVIS PDGKQIQVPA
KQLADGAARM TVTAAGPVRF RAVNQVGTTE LASDWIHVFD LPTGADVDYP KISNLPAIWL
DGMIMNTWAF DDTNLAAMLP AIPGGAGNSE GRGRAGHVGP VGRSGRVGRA GGRGGDPAAS
VPGRAEFPID PTTWFANPPE IPRRGRARRW KLPWT