Gene Tery_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1027 
Symbol 
ID4243106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1602686 
End bp1603732 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content35% 
IMG OID638106263 
Productcobalamin biosynthesis protein CobW 
Protein accessionYP_720875 
Protein GI113474814 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID[TIGR02475] cobalamin biosynthesis protein CobW 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.959481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTA AAATTCCAGT TACTGTAATT ACGGGTTTTC TCGGTAGTGG TAAAACTACG 
ACAATTCGCC ATTTATTAAA AAATAATAAA GGTCGTCGCA TTGCTGTTTT AGTGAATGAA
TTTGGTGAAG TTGGTATAGA TGGAGATTTA TTACGTTCTT GTCAAGTTTA TGATGAGGAA
GGTATTATTA ATAATATTGT TGAACTAAAT AATGGTTGTC TTTGCTGTAC TGTGCAAGAG
GAATTTTTTC CGACTATGCA AGAACTATTA AAACGTCGAG AAAAAATTGA TTCTATTTTA
ATTGAAACTT CTGGGTTAGC TTTACCAAAA CCTTTGGTAC AAGCATTTCG TTGGCCTCAA
ATTAAGACTT CTGCTACGGT GGATGGAGTT GTTACTGTTG TTGATTGTGA AGCTGTGGCA
AATGGTAGTT TGGTTGGAGA TATTGATGCT TTGAAAGCTC AACGTCAAGC TGACCCAAAT
TTGGATCATG AAACCCCTAT TGAGGAATTA TTTGAAGATC AGTTAGCTTG TGCTGATCTG
GTTTTATTAA CTAAGGTTGA TATGGTGGAT GAAGCTACTT CTGATAAAGT ACAAAATTGG
TTGAGGGAGC ATTTGCCTAA AACTGTGAAA ATAGTTCCTT GTATTGGAGG TGAAATTAAT
CCAGATTTAT TGTTGGGTTT TAATGCTGTA GTTGAAGATA ATTTAGATTC TCGTCCTAGT
CACCATGATA CTCAAGAAGA ACATGAACAT GATGATGAAA TTAATTCTGT ACATTTAATT
TTGGATGAAG AGTTTGAACC CCAAGGGTTA GTTGAAAAGT TGAACGGTTT AGTGACAAAT
TCTGAAATAT ATCGGATTAA AGGTTTTGTG GCAGTGCCAA ATAAGTCTAT GCGTCTGGTT
TTGCAGGGGG TGGGTTCACG CTTTGATTTT TTCTATGACC GTCTCTGGCA AAAACAGGAG
ACTAGGCAAA CTAAGTTAGT TTTAATTGGT CGTTCTCTAC AAAGAGAAAA AATTTACTCC
GAGCTGGTTT CTAATTTCTC TAATTAA
 
Protein sequence
MSAKIPVTVI TGFLGSGKTT TIRHLLKNNK GRRIAVLVNE FGEVGIDGDL LRSCQVYDEE 
GIINNIVELN NGCLCCTVQE EFFPTMQELL KRREKIDSIL IETSGLALPK PLVQAFRWPQ
IKTSATVDGV VTVVDCEAVA NGSLVGDIDA LKAQRQADPN LDHETPIEEL FEDQLACADL
VLLTKVDMVD EATSDKVQNW LREHLPKTVK IVPCIGGEIN PDLLLGFNAV VEDNLDSRPS
HHDTQEEHEH DDEINSVHLI LDEEFEPQGL VEKLNGLVTN SEIYRIKGFV AVPNKSMRLV
LQGVGSRFDF FYDRLWQKQE TRQTKLVLIG RSLQREKIYS ELVSNFSN