Gene Tery_4535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4535 
Symbol 
ID4246189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6997007 
End bp6998191 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content36% 
IMG OID638109412 
Productaldo/keto reductase 
Protein accessionYP_723988 
Protein GI113477927 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0927734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00892549 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATGCAAT ATCGACGCTT TGGGCGCACA GAATTATCAA TACCAGTGTT TTCCTGTGGC 
GGAATGAGGT ATAAATATAA ATGGCAAGAT GTTCCTAAAA ATGAAATCCC ACTTAATAAT
CAACAAAACT TAGAAAATAC AATCCGTCGC TCTCTAGAAT GTGGGATTAA CCACATAGAA
ACGGCCCGTA ATTATGGCAC ATCTGAAATG CAACTGGGAG AAATTTTACC TCAACTACCA
CGGGAAAAAT TGATTATCCA AACAAAAATT AGCCCAAGCG TTGACTCCCA AGAATTCAAA
TCCAAGTTTG ATCAGTCTCT CCATTTTTTA CAACTAGAAT ATGTTGACTT GCTGGCCATA
CATGGGATTA ACACGTTAGA ACGTCTTGAC TATTCTATTA GACCAGGAGG TTGTTTAGAT
ATAGTTAGAA AATTACAAGA GCAGGGAAAA GTCAGGTTTG TTGGTTTCTC TACTCATGGG
CCAACTGATG TAATAATTAA AACTATAGAA ACCAATCAAT TTGACTATGT TAACCTACAC
TGGTACTACA TTAATCAGGA GAATTGGTCC GCAATAGAGG TTGCTAATAA GTTTGATCTG
GGAGTATTTA TTATTAGTCC TTCTGATAAG GGTGGTAAAT TGTATCAACC GCCACAAAAA
TTAATAGATT TGTGCTATCC ATTAAGCCCA ATGGTGTTTA ATAATCTATT TTGTTTGAGT
CATCCCCAAG TTCATACATT GAGTTTGGGA GCTTCAAAAC CAACAGATTT TGATGAGCAC
TTAAAAACAT TGGAATTTTT AGAGAAACCA GATGAGATAT TACAACCAAT ATTAAATAGT
CTAGAAAAAG AAGCGATCGC TAAACTAGGA GAAAATTGGT ACCAAACTTG GCATATTGGT
TTGCCTACTC CAGAAAATAC TCCAGGAAAT ATTAATATTC CTGTGATTTT ATGGTTAAGA
AATTTAGCGA TCGCCTACGA TATGTGGGAA TATGCTAAAG TACGCTATAA CTTATTGGGC
AATGGTAGTC ATTGGTTTCC TGGTGCAAAT GCTGAACAAG TAGAAAAATA TAACTTGAGT
AAATTTCTTG TGAATAGTCC TCATGCTGAT AAAATTCCAG ATATTCTTCA AGATGCTCAT
CAATTACTGG TAGGGACTCC AGTAGAACTT TTGTCTAGCA CTTAA
 
Protein sequence
MMQYRRFGRT ELSIPVFSCG GMRYKYKWQD VPKNEIPLNN QQNLENTIRR SLECGINHIE 
TARNYGTSEM QLGEILPQLP REKLIIQTKI SPSVDSQEFK SKFDQSLHFL QLEYVDLLAI
HGINTLERLD YSIRPGGCLD IVRKLQEQGK VRFVGFSTHG PTDVIIKTIE TNQFDYVNLH
WYYINQENWS AIEVANKFDL GVFIISPSDK GGKLYQPPQK LIDLCYPLSP MVFNNLFCLS
HPQVHTLSLG ASKPTDFDEH LKTLEFLEKP DEILQPILNS LEKEAIAKLG ENWYQTWHIG
LPTPENTPGN INIPVILWLR NLAIAYDMWE YAKVRYNLLG NGSHWFPGAN AEQVEKYNLS
KFLVNSPHAD KIPDILQDAH QLLVGTPVEL LSST