Gene Franean1_5337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5337 
Symbol 
ID5673671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6433394 
End bp6434551 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content69% 
IMG OID641244195 
ProductTIR protein 
Protein accessionYP_001509601 
Protein GI158317093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.135441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAACG GGGGCAACAC GGACAGCAGC ACCACCAGCG GCGACGGCGA CGGCTGGGAC 
TTCTTCGTCT CCTACACTCA GCCCGACCGG GCATGGGCGG AGTGGATTGC CTGGACCCTG
GAGGAAGCCG GCTGGCGGGT GCTGATCCAG GCGTGGGACT TCACCCCAGG GTCGAACTGG
GTCACCGGCA TGGACGAAGG CGTCGCCGCC GCGGCGCGGA CGATCGCCGT GCTCTCCCAC
GCCTACACCC ACTCGGTCTA CGGCGCCGCC GAATGGCGCG CCGCCTGGGC GGCCGATCCG
ACCGGGGGAC AACGCAAGCT CCTGCCCGTA CGGATCGGTG ACTGCCCCCG ACCGGGCCTG
CTCGGCCAGA TCGGCTCCGT AGACCTCTTC GGCCTGCCGC AGGACCGGGC ACGCACAACG
CTGCTGGACG CGGCACAGCG TGTGGTCTCC GGCGGACGAG CAAAACCAGA CACGGCTCCG
CTGTTCCCCC CGGCCGGACG GGCAGTGCCC ACGCGGCCGT CGTTCCCCGG TAGCCGGCCA
GATGTCTGGA ATCTCCCGCC GCGGCTGGCC CACTTCGTCG GGCGCACCAC CCTCATCGAC
CAGATCGAAC ACGAGCTAGC CCGCGCGGGA TCGGTCGCGG TCTGTGCCCT GCACGGCCTC
GGCGGGATCG GAAAGACCGC CCTCGCCCTC GAATACGCCC ACCGGCATAC GACCGGCTTC
AACCTGGCCT GGTGGATACC CGCCGAAGAT CCACGGCTCA TCCCCGGACA CGTCTCCGCC
CTCGGCGTCG AACTCGGCCT ACCCGACGGC GCGGACTGGC ATGACGTACT TGGTGTGCTG
CGGCGCAAGC AACTGCGCTG GCTCCTCATC CTCGACAACA TCGAAGACCG GACTGTGATC
GGCCCGTTCC GGCCAACAGA TCACCTCGGC CGGCTGCTCG TCACCACACA GCGCGCCGGA
CTCGACGGCT ACGGCACTCA AATCGCCGTA CCCGAACTCC CCCGGCATGA CGCGGTGGAC
CTGCTCACCC GTCGGATACC GAGCATCGAA GTGGGGACGG CCGGACAGAT CACCGATCTC
CTCGGGAACC TGCCCCTCGC GGTGGAACAA GCCGCCAGCG CCCCGTTACA TCCGCGCTGG
CACCGTCCAG CCCGCTGA
 
Protein sequence
MNNGGNTDSS TTSGDGDGWD FFVSYTQPDR AWAEWIAWTL EEAGWRVLIQ AWDFTPGSNW 
VTGMDEGVAA AARTIAVLSH AYTHSVYGAA EWRAAWAADP TGGQRKLLPV RIGDCPRPGL
LGQIGSVDLF GLPQDRARTT LLDAAQRVVS GGRAKPDTAP LFPPAGRAVP TRPSFPGSRP
DVWNLPPRLA HFVGRTTLID QIEHELARAG SVAVCALHGL GGIGKTALAL EYAHRHTTGF
NLAWWIPAED PRLIPGHVSA LGVELGLPDG ADWHDVLGVL RRKQLRWLLI LDNIEDRTVI
GPFRPTDHLG RLLVTTQRAG LDGYGTQIAV PELPRHDAVD LLTRRIPSIE VGTAGQITDL
LGNLPLAVEQ AASAPLHPRW HRPAR