Gene Franean1_0008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0008 
Symbol 
ID5668435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp12333 
End bp14270 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content76% 
IMG OID641238936 
Producthypothetical protein 
Protein accessionYP_001504383 
Protein GI158311875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA GTCAGAATGA CCGGCCCGCC CGGTCGGCCG TCCACCGCGC ATCCCCGGCC 
CGTCCGGAGC AGCGCACCGC GGCCGTGGCC GGTGGCGCGC CTGTCGGTGG TACCGCCGCC
GGGCGCGCCG CGGCTGGCGG TACTGCCGCC GGCGGTGCCT CTGTGGCTGG TTCCAGCCGG
TCCGGCACGG CGAAGGCCGG GCCCGTGGGC GCGGCACCAG GCGTGGGTTC CGGTGCCGGT
TCCGGAACTG GGATCGGCAA GGGAGTTCCC GGGACAACCG GCTCCGGAAC CGGCGTGAGC
TCGGCGGCGG GGGGCGGTGT GGGTGGCCCC CGCTCGACAC CGGGGAAGGA CGACGGAGAG
TCCAAGCGTG CGTTCCTCCC CGGCACCGGC CGTCCCGCCG ACGGGCGTCC CGGGACCGAT
GGCCGTCCCG CTGACGGCCG CTCGGGGGAG GGCCGTCCGG CCGCGGGGGG CGGGGCTGGA
TCGGGTTCAC GTGGTCCGTC CCGGCCCTCC TCCGGATCGG CGTCGCCCGG AAAGGGTGCC
CCCGGATCGG CGTCGCCCGG CACCTCCGGA TCGGCGTCGC CCGCGCGTGG CGCGGGCGCA
AGCGGTGCGG GCGCGGGCGG GGCGGCGAGC CCGTTCTTCC AGCGTCCCGG GCGGGACGAG
AACGACCCGT CCGACCGGGA CGGTGCGGGT GGCCAGACCA GTCAGACGAC ACGCCTCGGC
GTCGGCGGGA CGGCAGAGCC CAAGCTGCTG AGCGCCTCGG CGACCGTTCC GGGCTCGTCC
TCGGCGGGCA CCGGCGGCGG GACGGACCGC GCCGCCCCCG CGCCGCGCGG CGGTGACTCC
GACCCCGACG AGACATCACG CCAGCCAGCC GGCCGGCAGC CCGCCCGCGA CGCGGACGCG
GCCAGGCCGG CGGGCGACGG CAGGAAGCCG ACCGAGGCGA CCCGCGGGGC CGGCGCGGGC
ACCTCCAAGG ACGCCGCCCC GGCGGCTGGC AAGGGGGCAT CCCGAACGGG CACGCGAGCG
GACGCCGCCC GACCGGACGC CGCTCGCCCG GATACATCCA GAGCCGAGAC GACGAGAAAG
CCCGATGCGC AGACCACCGT CAGAAGCCCG GCGGCACGCC CCGGCACACC GGTCGACCCA
CCCCGGGACA GCGACACGAT CTCGCTGGTG CGACCGAACC TGCCCAAGCG GGGCTCGAAG
CCCCCGGCGG ATCGCGTCGG CGCGGACGTG AAGACCTCCC CGGATCGCGG CGCGGTCACC
GACCGCATGC CCGCCGAGCG CCGCCCCACC CCTGAACCGG TGCGGGCAGC CACTCCGTCG
ACCCGGCAGG CGCCGCTGGC GGGCCGCACA GCCCCCTACG ATCGCCCTGG GACGCCGCCA
GGGCCACTGC CGACGTCCGG AGGTGTGGGC CCGAACGGTC TGTCCACCGA GCCGTTCGAC
CGGGTCGACG ACGCCGACCA CGGCCGGCCC GGCGGTGCTC CTGGACCACA GGGTGGTCCC
CCGCGGGGGC AGCAGCAGCC GGGCGGCCGT GAGCCGGGGC GGGACACCGC CGGCCAGGGC
CCACGCCGGG GCCCCGCCGG TGGCCGGCGT GCCCGCCTGC GGGTCTCCAG GGTGGAACCG
CTCTCGGTGA CCAGGCTCTC GTTCGCGTTC TCGCTGTGCG TCTTCCTGAT CATGATTGTC
GCCGTGGCGG TGCTGTGGTT CGTGCTGAAC TCGATCGGGG TCTTCGACAG CGTCACCAAG
GCCGCTGACA CCCTGACCGA CGGCACGAAC GCCAATGTCT CAGGCTGGCT GTCCTTCGGG
CGGGCGATGC AGGTCACCCT GCTGGTCGGG GCGATCAACG TCGTCCTGAT GACGGCGCTG
GCGACCCTGG GCGCACTGCT CTACAACCTC TGCGCGGACA TGATCGGCGG GCTCGAGGTC
ACCTTGAGTG ACCAGTAG
 
Protein sequence
MSDSQNDRPA RSAVHRASPA RPEQRTAAVA GGAPVGGTAA GRAAAGGTAA GGASVAGSSR 
SGTAKAGPVG AAPGVGSGAG SGTGIGKGVP GTTGSGTGVS SAAGGGVGGP RSTPGKDDGE
SKRAFLPGTG RPADGRPGTD GRPADGRSGE GRPAAGGGAG SGSRGPSRPS SGSASPGKGA
PGSASPGTSG SASPARGAGA SGAGAGGAAS PFFQRPGRDE NDPSDRDGAG GQTSQTTRLG
VGGTAEPKLL SASATVPGSS SAGTGGGTDR AAPAPRGGDS DPDETSRQPA GRQPARDADA
ARPAGDGRKP TEATRGAGAG TSKDAAPAAG KGASRTGTRA DAARPDAARP DTSRAETTRK
PDAQTTVRSP AARPGTPVDP PRDSDTISLV RPNLPKRGSK PPADRVGADV KTSPDRGAVT
DRMPAERRPT PEPVRAATPS TRQAPLAGRT APYDRPGTPP GPLPTSGGVG PNGLSTEPFD
RVDDADHGRP GGAPGPQGGP PRGQQQPGGR EPGRDTAGQG PRRGPAGGRR ARLRVSRVEP
LSVTRLSFAF SLCVFLIMIV AVAVLWFVLN SIGVFDSVTK AADTLTDGTN ANVSGWLSFG
RAMQVTLLVG AINVVLMTAL ATLGALLYNL CADMIGGLEV TLSDQ