Gene Tery_4990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4990 
Symbol 
ID4246645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7630504 
End bp7631664 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content37% 
IMG OID638109801 
Productradical SAM family protein 
Protein accessionYP_724377 
Protein GI113478316 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.396764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.819586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA ACACCCATAA AATACCCCCC CGTAACAATA CAAAAACTAC CACCCCAACC 
AAAAATCGGG GAATGTCAAC CACAACCAGC ATCCTAGATA CCCTACAGCT AGTTTTTCAA
GCAGGAGGGC CAGGATTTTG TCAATTTGCC ATTAATAACG CTTGTAACGC CAACTGCGGT
TTCTGCAACT TTGCAAGAGA CACCTTTCCT AAAGAAAAAA CAAAATTTGT TAACCTTAAT
GAAGGATTAG ACTCAATTAA TATTCTATTT CGCGAAGGTA TCAGATATCT AGTTTTCACA
GGTGGAGAAC CAACCCTTAA CCCCAACTTA ATCTCATTTG TTGACCATGC AACAAGATTA
GGAATCAAAG TTATGGTAGT AACAAACGGT GGGTTATTAA CCCCCAAAAA AATTGAGGCA
TTAGCAGACG CAGGTTTATC TAGTCTAATC ATCTCAGTTG ATGCCGCCGC TCAAGACATC
CATGAAAAAA ATCGTGGGTT ACCTGGTGTC TGCAATAAAA TAAAAGAAGC AAATCAAATC
CTCAAAAATA TTGGTATGTA TTCCACCGCT TCCGTCACAA TGAGTCATTT AGTTGACTAT
GATGCATTGC CAGATTTTCT CACCTCCCTA GGATTTGAAT CAGTAGTTTT TTCCTATCCA
TTAACAAATC TTGATTCTAA CTTTTTAGGA TATTCTGATG CAGGTTTAGT CACTCATACC
AATGAAAGTC TCTGGCAAGC CTACGAAAAA GTCAAGCAAT TAAAAAAGCG ATTTCGTGTA
GTTAACCCCA CTCTTTCTTT AGAAGAAATG CAGAGATTCG TCAAAAATGA AGAACAGCGT
TATTCTTGTT TAGCAGGTTA TCGTTTCTTC TTTCTTGACT GGGAATTAAA CTTATGGCGT
TGTCATTTTT GGCATGAACC AATGTGTTCT ATTTATGAAT TTGATAGCTC CAAATTAGTC
AGAGATAATT GTACAAAATG TATGATTAAT TGTTATCGAG ATTCTAGTTT AATGCAGAAT
ATTGCCGTAT CAATGCAGGA TGCTTATCAG TCAATTAAGA AGGGAAATTT ATTAGATACA
GCAAAAGCTT TAACAAGAAA AAGTAATGTT GATTCCCTGC GAGCAGTGGT AGAAGAATTA
CCTTGGATAA TGCGTTTTTA A
 
Protein sequence
MKTNTHKIPP RNNTKTTTPT KNRGMSTTTS ILDTLQLVFQ AGGPGFCQFA INNACNANCG 
FCNFARDTFP KEKTKFVNLN EGLDSINILF REGIRYLVFT GGEPTLNPNL ISFVDHATRL
GIKVMVVTNG GLLTPKKIEA LADAGLSSLI ISVDAAAQDI HEKNRGLPGV CNKIKEANQI
LKNIGMYSTA SVTMSHLVDY DALPDFLTSL GFESVVFSYP LTNLDSNFLG YSDAGLVTHT
NESLWQAYEK VKQLKKRFRV VNPTLSLEEM QRFVKNEEQR YSCLAGYRFF FLDWELNLWR
CHFWHEPMCS IYEFDSSKLV RDNCTKCMIN CYRDSSLMQN IAVSMQDAYQ SIKKGNLLDT
AKALTRKSNV DSLRAVVEEL PWIMRF