Gene Tery_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4030 
Symbol 
ID4242058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6231721 
End bp6232740 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content39% 
IMG OID638108940 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_723521 
Protein GI113477460 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.17049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAGAG TTGCCATAAA TGGTTTTGGA CGTATTGGAC GGAATTTTTT ACGTTGCTTG 
CTTACTCGTA CAGATAGTCA ACTCGATCTT GTAGCTGTTA ATGATACTTC TGACCCCAAA
ACCAATGCTC ATCTGCTAAA GTATGACTCA ATGTTGGGTA CTTTAAAAGA TGTAGATATT
AGTACGGATG AAAATTCTAT TACAGTCAAT GGTAAAACAA TTAAATGTGT TTCTGACCGT
AATCCTTTAA ATTTACCATG GGCAGACTGG GGTATTGACT TAATTATTGA ATCTACAGGT
GTATTTGTGA CTGAGGAAGG AGCTTCAAAG CATTTACAGG CTGGAGCTAA AAAGGTTTTA
ATTACTGCTC CTGGTAAGGG TGGACAAATT GGTACTTTTG TAGTAGGTGT TAATCATCAG
GATTATAAGC ATGAAGATTA CAATATTATC AGTAATGCTA GCTGTACAAC TAATTGTCTA
GCTCCTATTG TCAAGGTCAT CCATGACAAC TTTGGTATTA TCAAAGGTAC AATGACTACA
ACTCACAGTT ATACAGGCGA CCAACGTATT TTGGATGCTA GCCACAGAGA TGTACGAAGA
GCCAGGGCTG CGGCGGTAAA TATTGTGCCA ACTTCTACAG GTGCAGCTAA AGCAGTAGCT
CTAGTAATTC CAGAAATGAA AGGTAAATTA AATGGTATTG CGATGCGTGT TCCTACTCCT
AACGTTTCTG TGGTAGATTT GGTTGCTCAA GTTGAGAAAA AGACATTTGT TGAACAGGTA
AATGAAGTTA TGGAGGTGGC TGCTAAAGGG CCAATGAAGG GAATCATTGA ATACAGTGAT
TTACCTTTAG TTTCTATCGA CTATCGTGGC CATGATTGCT CTTCAATTGT TGATGCTAGC
CTGACAATGG TTATGGATGG GGATATGGTA AAAGTTATTG CTTGGTATGA TAATGAGTGG
GGTTACAGTC AGCGTGTTGT AGACTTGGCT GAAGTTGTAG CACAGAATTG GGCTGCCTAG
 
Protein sequence
MIRVAINGFG RIGRNFLRCL LTRTDSQLDL VAVNDTSDPK TNAHLLKYDS MLGTLKDVDI 
STDENSITVN GKTIKCVSDR NPLNLPWADW GIDLIIESTG VFVTEEGASK HLQAGAKKVL
ITAPGKGGQI GTFVVGVNHQ DYKHEDYNII SNASCTTNCL APIVKVIHDN FGIIKGTMTT
THSYTGDQRI LDASHRDVRR ARAAAVNIVP TSTGAAKAVA LVIPEMKGKL NGIAMRVPTP
NVSVVDLVAQ VEKKTFVEQV NEVMEVAAKG PMKGIIEYSD LPLVSIDYRG HDCSSIVDAS
LTMVMDGDMV KVIAWYDNEW GYSQRVVDLA EVVAQNWAA