Gene Tery_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3126 
Symbol 
ID4244256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4782899 
End bp4783945 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content32% 
IMG OID638108138 
Productaldo/keto reductase 
Protein accessionYP_722731 
Protein GI113476670 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATATA AACAACTAGG TGATAGTCAA TTACAGGTAT CAGAAATTTG TTTAGGTACC 
ATGAATTATG GTAAACAAAA TACCTTAGAC GAAGCACAAA AATTACTAGA TTATGCTTTT
TCTCAAGGAA TAAATTTTAT TGATACTGCA GAAATGTATC CTGCTCCTAC CTGTGCCAAA
ACTCTAGGTA AAACAGAAGA ATATATTGGT AAATGGTTAA GTAAAAAACA ACGAGACAAA
GTTACTATCG CAACTAAGGT TTGTGGTAGA CCAAATGAAA CTTTATCTCT AAATTGGATT
CGAGAAGGAA AAAGTTGTAT CAATGATGTT AATGTTCAAA AAGCTATTGA TGGTAGTCTT
TCAAGATTAC AAACTGACTA CGTTGATTTG TACCAAATTC ACTGGCCAGA TCGTTATGTT
CCTTTATTTG GTGCATCTGA TTACAATCCT CGTTATGAAA GGGAAACAAT ACCAATTATT
GAACAATTAG AAGTTTTTGC CGATCTAGTT AAAGCTGGAA AAATTCGTTA TTTAGGTATA
AGCAATGAAA CTCCTTGGGG AGTATCTGAA TTTTCTCATT TAGCACAACA ATTAGGATTA
CCAAAAATTG TTTCGATTCA AAATGCTTAT AATTTAACAA ATCGAGTTTT TGAAATAAAT
CTAGCAGAAA CTTGTCACTT TCATCGAGTT GGATTAATGG CTTATAGTAC CTTAGCTTTT
GGCTACTTAT CAGGTAAATA TTTGTCTAAA ATTCCCAAAA ACTCAAGATT AGATTTATTT
CCTGGGTTTG ATAGACGTTA TCATAAACCT AATTTTGCCG AAGCAGTAAA AGCTTATGTA
GATATTGCTC ATCAATATGA GCTTACACCT GTTCAGTTAG CACTAGCTTT TGTTCGTTCT
CGTTGGTTTG TTACTACTAC TATTATTGGA GCATCAACAA TAGAACAACT TCAGGAAAAT
ATTTCCAGTA TAGAAATAGA ATTAGACTCA GATATATTAG CACAAATAGA TCAAATTCAT
GCCCGTTATC CTAATCCAAC TTCCTAA
 
Protein sequence
MRYKQLGDSQ LQVSEICLGT MNYGKQNTLD EAQKLLDYAF SQGINFIDTA EMYPAPTCAK 
TLGKTEEYIG KWLSKKQRDK VTIATKVCGR PNETLSLNWI REGKSCINDV NVQKAIDGSL
SRLQTDYVDL YQIHWPDRYV PLFGASDYNP RYERETIPII EQLEVFADLV KAGKIRYLGI
SNETPWGVSE FSHLAQQLGL PKIVSIQNAY NLTNRVFEIN LAETCHFHRV GLMAYSTLAF
GYLSGKYLSK IPKNSRLDLF PGFDRRYHKP NFAEAVKAYV DIAHQYELTP VQLALAFVRS
RWFVTTTIIG ASTIEQLQEN ISSIEIELDS DILAQIDQIH ARYPNPTS