Gene Tery_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1933 
Symbol 
ID4242682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2998586 
End bp3000205 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content34% 
IMG OID638107054 
Producthypothetical protein 
Protein accessionYP_721661 
Protein GI113475600 
COG category 
COG ID 
TIGRFAM ID[TIGR03187] DGQHR domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.258826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0613304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAA AATTAATGAT AAAAAAGGCC AATAAACCGA CATCAGAAAT AGCAAAAGAA 
ATTTTAGAAA ATGATAATCG AGAAAAAGAA GCGATCGCTA TCCTATTAGA CAAACATATA
GGCAAAGACA ATAGATTATT AGTACAAAAA ACCATGATGG GAAACACAGA AGCTTACATT
GGTTCGGTCA CTCTAGAATG GTTAGATAGT CGTGTACGCT TTGCCTCTCA ACTACCATTA
TTTAGGCAAA AATTTGACAT AGAAACTGAT AATATAATTC GGGATTCAGA AACAATAGAT
GAAATTCAAC AACGACCCTT AGACTGGTCT CGTCAAGCTC CATTAACTTT ATATTTAGCA
ACTAGAAAAT CCCATAAATT TCCGGCAGTA TTAGTAGTAA TTAGTCCGAG TTGGGTAGAT
AATCCTAAGG CAGAAGAATG GAATAAAAAT GGTGAAGCAA ATAAATCTGC AACTGATTTT
TTTTCCCTAG ATTCACAGGG AAAAGTAGGA TTATTAGACC TACGTTTAGA AGTAGCAGTA
TTTGCCTTAG ATGGTCAACA TAGACTGATG GGAATTCAAG GATTAATGGA ATTAATAAAA
ACTGGTAGAT TACCAAGATA TAACAAACAA AAGAAATCAG TAGGTGCAGC AATTACTATT
GATGATTTAA CTATAACTCA TCAGATTGAA TTACCAGAAA TACAAAAATT AGCTTACGAA
CAAATAGGAA TAGAATTTAT TCCAGCAGTA GTAAAAGGGG AGACAAGAGC ACAAGCACGA
CGCAGAGTTA GGTCAGTTTT TGCTCATGTA AATTTGACAG CAGTAAAATT AAGTAAAGGG
CAATTAGCAT TATTAAATGA AGATGATGGA TTTGCTATTG TAGCGAGAAA AATAGCAATT
TATCATCCTA TTTTAAAGGA GAAAGATGGT AGAAATCCAA GAATAAATTG GGATAGTGCA
ACTGTGGCAG CTAACTCTAC TGTTTTAACT ACTCTCCAAG CATTGCAAGA AATGTCTGAA
AGATATTTAA AACCTCGTTA TCCCTATTGG AAACCTTCAG ATAGAGGTTT AATTCCTATG
CGTCCTGCAG AAGAAGAGCT AGAAGAGGGG GTAGAAGAAT TTATGGTACT TTGGAATTAT
TTGTCTAATT TGCCTAGTTA TTCTAGATTA GAAAATGGCT CTGAAACTTC GGAATTAAGA
AGATTTAGTT TTGAGAGAAA ACCGGGAGAA GGTCATGTTT TGTTCCGCCC TATTGGGCAA
ATTGCTTTTG CTGAAGCTTT AGGGATTTTA ATATATAAAA AAGAATTTTC TCTCAAAGAA
GTTTTTCATA AATTAAATAA GTATGATGTG GATGGTGGTT TGAGTGGAAT AGAATTTCCT
GACTCAATTT GGTATGGGGT TTTATATGAT TTTAATCGGA AACGAATGTC GGTAGCTGGT
AGAGATTTAG CAATGAGATT ATTTATCTAT ATATTAGGTG GAGTTTCTGA CAAAATGGAG
CGGGCAGAAG TTCGTCGGCA GTTGGCGCAA GCAAGACGAG TTGGAGAGGA TCAAGCTGTA
GATTTTCAGG GTAAGTTTGT CGAATTAAAA AAAGTAGGAT TACCTGAAAT TTTATATTAA
 
Protein sequence
MSKKLMIKKA NKPTSEIAKE ILENDNREKE AIAILLDKHI GKDNRLLVQK TMMGNTEAYI 
GSVTLEWLDS RVRFASQLPL FRQKFDIETD NIIRDSETID EIQQRPLDWS RQAPLTLYLA
TRKSHKFPAV LVVISPSWVD NPKAEEWNKN GEANKSATDF FSLDSQGKVG LLDLRLEVAV
FALDGQHRLM GIQGLMELIK TGRLPRYNKQ KKSVGAAITI DDLTITHQIE LPEIQKLAYE
QIGIEFIPAV VKGETRAQAR RRVRSVFAHV NLTAVKLSKG QLALLNEDDG FAIVARKIAI
YHPILKEKDG RNPRINWDSA TVAANSTVLT TLQALQEMSE RYLKPRYPYW KPSDRGLIPM
RPAEEELEEG VEEFMVLWNY LSNLPSYSRL ENGSETSELR RFSFERKPGE GHVLFRPIGQ
IAFAEALGIL IYKKEFSLKE VFHKLNKYDV DGGLSGIEFP DSIWYGVLYD FNRKRMSVAG
RDLAMRLFIY ILGGVSDKME RAEVRRQLAQ ARRVGEDQAV DFQGKFVELK KVGLPEILY