Gene Tery_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3901 
Symbol 
ID4243564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6027159 
End bp6028307 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content33% 
IMG OID638108827 
Producthypothetical protein 
Protein accessionYP_723409 
Protein GI113477348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTACAT ACATACTGGC CTTAGCTGTT GGTCTTGGTA GTTTAGCTTT GTATATAGTG 
GCCTTTTTCT TTCCAGAGGT CCATCGGAAA AACGATTTTA TCTGGAGTGG TGTAGGTCTT
TTCTATGCCT TGGTATTGTG GTTTTGTGCA GGTAGAATTA CAGGTGCGGT GTTACTGGGT
CAAGTGGCAA GTGTAGCTTT ATTAGGTTGG TTTACTTCAG AAAGTTTAAT GTTGCGTCGC
CAAGTGACTC CAGTTGTAGA GCAAACCAAA ATATCAACTG AGAAAAGTAC TGAAGATTAT
ACTCAGAAAA AATCAAAAAT TGTTTCTGAA ACAACATCTA TATCAGAAGT AGAGAATATG
GAAAAATTAG ATTTATCTGA TTCCCCAATA ACCTCTACCA AACAGTCAGA AAATATTACT
ACTGAAGAGT TGGTGAATAT TGTTAAAACA GGAGAAACTG AATCTGAATT ACTATCATCT
GAAACTACTT CAGACTTAAG TAAAATAATC AAAGAATCTG AAGCAGAGAC AACTGAAAGT
ATGACAGAAA AAAATGTTTT AACTGATGCT AAAACTGAAT TAGATACATC TCAGAAATTA
GATAAAAGTT TATCTAAAAA AGCTCGTGGT TTTGCTCAGT TATTGACACC TATGAGTGGA
ATATTGAGTA ATATTAAAAA TGTCATTCAA GGTAGAGATA ATAAAAATAC TGATTCTGAC
TCAATATCTA CACAAAATCA AGCTGATACT GAGAAATTAA CTTCTATTGA GGAGGTAAAT
ACTGAAGTTA ATGAGACTAT AAGTCAGACA GAAGATACTC AAGCTAAACA AGAATCTTTA
ATAGAGAAAG AAGAGTCTAT AGTATCTGAT GTTAAAACAG ATAAAACTAC TTTGACAGAA
GTTGAAAAGG AAGCAAACTC GTCTTCAAAA TTAGAATCTA CACCTACTGA AAAACCTGCT
ACGGAAACTT CTAAATTAGC TGAAGTTTCT GCGCTTGAAG ATAGCTCTTC TTCACCAGAA
ATAATAACTA CTCAAGATAG TCAGAATCAG GAAGAAAATT TGACTGCTAT TTCTTCTGAG
GAGAAAAATG AGACTGATAA TTCAACATCA GATTTATCAA AAGATAGTCA AAATAAGTCA
GTAGATTAG
 
Protein sequence
MLTYILALAV GLGSLALYIV AFFFPEVHRK NDFIWSGVGL FYALVLWFCA GRITGAVLLG 
QVASVALLGW FTSESLMLRR QVTPVVEQTK ISTEKSTEDY TQKKSKIVSE TTSISEVENM
EKLDLSDSPI TSTKQSENIT TEELVNIVKT GETESELLSS ETTSDLSKII KESEAETTES
MTEKNVLTDA KTELDTSQKL DKSLSKKARG FAQLLTPMSG ILSNIKNVIQ GRDNKNTDSD
SISTQNQADT EKLTSIEEVN TEVNETISQT EDTQAKQESL IEKEESIVSD VKTDKTTLTE
VEKEANSSSK LESTPTEKPA TETSKLAEVS ALEDSSSSPE IITTQDSQNQ EENLTAISSE
EKNETDNSTS DLSKDSQNKS VD