Gene Tery_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2077 
Symbol 
ID4245725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3243857 
End bp3245767 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content39% 
IMG OID638107188 
Producthypothetical protein 
Protein accessionYP_721791 
Protein GI113475730 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.189777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.779158 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATC AAAATATAGC AATTACTATA GGAGTTCAGG AATATGAGTT TTTGACTCCT 
CTAAAATATG CAGCTAATGA CGCTAAAAAA ATGAGGGATT TTTTGCTCGA CGAAGCAGAT
TTTGATGACG TTTTTTACTT GTCAGATAAT TCTCCAAAAA TTAATGGTGC TTCAACTCGC
CCAACTCGTT CCCGCTTAGA ATTGGTGTTA GAAGATGAAG TTAAAAAACT ATCTCTGAAA
ACTGGTGATA ACCTTTGGTT TTTTTTTAGC GGCCATGGTC ATAGAGAGAA TAATAATATT
GATTATTTAA TTCCTATTGA TGGTCATTCA AATGTTGAAA GAAGTGGAAT TTCTGTTGAC
TATATTATTC AACAACTGCA AAAATCTGGA GCAGATAATA TAGTTCTAAT ATTGGATGCT
TGTAGAAACA AAAGTGATGG GGGAAAAGGG GGAGAAGGAC TAGGAAGGCA AACAGAACAA
GAAGCTCGTG AAAAGGGGAT AGTAACTATT TTTTCTTGTA GTCCAAATGA ACGTTCTTGG
GAATTAGAAG AGTTGCAACA GGGAGTTTTT ACTTATGCGT TATTAGAGGG GTTAGGTAGT
AAGGGTCAAA AAGCAACTGC AGAAAGACTG AATGAATATC TGAAGTATCG GGTGCAAGAG
CTAGCTCAAA AGGAAGGAAA ACGACAGACC CCTCGTATTA TTGCTGACCC TATTGAAAAG
TCTCATTTAA TTTTAATGCC GAAGTATGCA ACAAAAACTG ATGTTTCTAC TCTGAAAATT
GACGCTTACA GAGCTCAAAC AAATGGAGAT TTTAATTGGG CAGAACAACT ATGGATAAGA
GTTTTAGAGG TAGGTTTGGA TCTTGAAGCA GTTAAGGCTC TGCAGAAAAT TGCTGTTGAT
CGGTTTAGAT CTCAGTTGGT TTCCACCCCA CAGTTAGATC TAGGTTATTT TCAGAATTTA
CCGGTTTCAG AAACTCAACA ACCAAGATCA ACAGAAGTAT TAGAAGGTTC AAATATTCAA
ACTTCACTAC AGTCACAACG GAAGTTAGAA CCAAACTCAC AAATAAAAAC AAAGTCACAG
CCAAAAATTA CTATTCATAC ATTTGCGACT CCTAAAGTTA ACAGGAAAGG AGAAATAATA
AGCCGCTCTG AAGGTGAAGC AGAAGTAATG ATAGAAAATT TAGGTAATGG AGTTACTCTG
GAAATGGTAA AAATACCCGG TGGTAGTTTT CTGATGGGGT CTCCAGAGAC GGAAGCACAG
AGAAGTGATA ATGAAGGTCC GCAACATCAT GTAGATGTGC CAGAATTTTG GATGGGAAAG
TATGTCGTTA CTCAACAACA GTGGCAAGCA ATAATGGGAA ATGATCCTTC GAAATTTAAA
GGAAAAAATC GCCCTGTGGA AAGAGTAAGT TGGAATAACG CTACAGAATT TTGTCAAAAG
CTCTCTAAGA AAACAGGAAG AGACTATAGA CTACCGAGTG AAGCAGAATG GGAATATGCC
TGTCGTGCTG GGACAACTAC ACCTTTTTAT TTTGGAGAAA CTATCACAGG AGAATTAGCT
AATTATAGAG CTTCAGAGAC TTATGCTGAT GAACCAAAAG GAGAATATAG AGAACAAACA
ACTCCTGTAG GTGAGTTTCC ACCTAATGCT TTTGGTCTAT ATGACATGCA TGGGAATGTC
TGGGAGTGGT GTCAGGATGT TGTGCATAGT AATTATGATG GAGCACCTGT TGATGGAAGT
GCTTGGGTAA ATGGAGGCGA TAGTAGCGGT AGAGTGCTTC GTGGCGGCTC CTGGCTCAAC
TATCCTGGGT GGTGTCGCTC TGCGAGCCGC GGCTACTATG TCTCGGTCGT GGTGGTCAGC
TCCAATTTTG GTTTTCGTCT TGTGAGTTTC CCCCCCAGGA CTCCTGAATA G
 
Protein sequence
MTNQNIAITI GVQEYEFLTP LKYAANDAKK MRDFLLDEAD FDDVFYLSDN SPKINGASTR 
PTRSRLELVL EDEVKKLSLK TGDNLWFFFS GHGHRENNNI DYLIPIDGHS NVERSGISVD
YIIQQLQKSG ADNIVLILDA CRNKSDGGKG GEGLGRQTEQ EAREKGIVTI FSCSPNERSW
ELEELQQGVF TYALLEGLGS KGQKATAERL NEYLKYRVQE LAQKEGKRQT PRIIADPIEK
SHLILMPKYA TKTDVSTLKI DAYRAQTNGD FNWAEQLWIR VLEVGLDLEA VKALQKIAVD
RFRSQLVSTP QLDLGYFQNL PVSETQQPRS TEVLEGSNIQ TSLQSQRKLE PNSQIKTKSQ
PKITIHTFAT PKVNRKGEII SRSEGEAEVM IENLGNGVTL EMVKIPGGSF LMGSPETEAQ
RSDNEGPQHH VDVPEFWMGK YVVTQQQWQA IMGNDPSKFK GKNRPVERVS WNNATEFCQK
LSKKTGRDYR LPSEAEWEYA CRAGTTTPFY FGETITGELA NYRASETYAD EPKGEYREQT
TPVGEFPPNA FGLYDMHGNV WEWCQDVVHS NYDGAPVDGS AWVNGGDSSG RVLRGGSWLN
YPGWCRSASR GYYVSVVVVS SNFGFRLVSF PPRTPE