Gene Tery_4135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4135 
Symbol 
ID4245649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6379583 
End bp6380458 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content41% 
IMG OID638109036 
ProductFe-S cluster assembly protein NifU 
Protein accessionYP_723616 
Protein GI113477555 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0694] Thioredoxin-like proteins and domains
[COG0822] NifU homolog involved in Fe-S cluster formation 
TIGRFAM ID[TIGR02000] Fe-S cluster assembly protein NifU 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000225432 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGAAT ATACTGAAAA GGTAATGGAT TTGTTCTATA ACCCCCAAAA CCAAGGAACT 
ATTACAGACA AAAAAGAGGG GGAAAAGATA GTTAGTGGTG AAGTAGGAAG TATAGCCTGT
GGAGATGCTT TAAGTCTACA CCTCAAAGTA AATGAAGCCT CCGGTGAAAT ATTAGATGCC
AAATTTCAAA CCTTTGGCTG TGCAAGTGCG ATAGCTTCAT CTTCTGCCCT AACAGCAATG
CTCAAAGGCA AAACCATAGA CGAAGCCATG AATATTAAAA ACCAGGATAT TGCCGGATAC
CTGGGAGGGC TGCCAGAAGA AAAAATGCAC TGTTCAGTCA TGGGAGAGGA AGCATTAGAA
GCGGCAATAT TTAAGTACAA AGGTATTGAA GTAGAAGTTC ACGAAGAAGA CGACGAAGGA
TCATTAGTTT GTAGTTGTTT TGCGATAACA GAAAACAAGA TTAAGCGAGT TATTTTGGAA
AACAATCTCA AAACAGCGGA AGAAGTAACA AACTATGTCA AAGCTGGTGG TGGTTGTGGT
TCTTGTCTGG CAGATATTGA TGATCTCGTC GCATCGGTTT ATGAAGCGCC AGACACTACA
ACGCAACAAA TTCCTACAAC TACTAAACCA GCAACCAACC TGACAAACTT GCAAAAAATT
ACATTAATTC AGCAAGTATT ACAACAAGAG GTGAGGCCAG TTCTCGCCGA AGATGGAGGA
GATGTTGAGT TATTCGATGT AGATGGCGAT CGCGTGCTAG TCAAACTCAA AGGAGCTTGT
GGTTCTTGCA GTAATGTGCT AGTAACGCTA AAAGGAGCGA TCGAAGCTAC ATTAAAAGAA
CGAGTTAGTG AAAGTCTTGT AGTAGAAGCG GTATAA
 
Protein sequence
MWEYTEKVMD LFYNPQNQGT ITDKKEGEKI VSGEVGSIAC GDALSLHLKV NEASGEILDA 
KFQTFGCASA IASSSALTAM LKGKTIDEAM NIKNQDIAGY LGGLPEEKMH CSVMGEEALE
AAIFKYKGIE VEVHEEDDEG SLVCSCFAIT ENKIKRVILE NNLKTAEEVT NYVKAGGGCG
SCLADIDDLV ASVYEAPDTT TQQIPTTTKP ATNLTNLQKI TLIQQVLQQE VRPVLAEDGG
DVELFDVDGD RVLVKLKGAC GSCSNVLVTL KGAIEATLKE RVSESLVVEA V