Gene Tery_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3964 
Symbol 
ID4244047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6127659 
End bp6130481 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content48% 
IMG OID638108880 
Producthypothetical protein 
Protein accessionYP_723462 
Protein GI113477401 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.820547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA ACACAAAAAG TATAGTAATA ATAGATGCTA GTGTAGAAAA CTACCAGCAA 
CTATTAAAGG GAGTCGTTAC GGGAGTTAAA CCTTTTCTCC TCGGCGGCGA CACTGACGGT
ATCCAACAGA TAGGGGATAT CCTCCAAAAA AATCCAGAAA CGGATACCCT TCATATTATC
TCCCACGGTT CTCCTGGTTG TCTGTATCTG GGAAATAGCC AATTGAGTTT GGATACTCTC
AAGGGCTATG AGTCCCAACT GCAACAATGG CAACTAGACA ACCTTCTGCT CTATGGTTGT
AACGTCGCTG CCGGGGATGG GGGGGAAGAG TTTATTGATA AGTTGCATCG GTTGACGGGG
GCTGAGATAG CGGCTTCTAA GTCTTTGACT GGGGCGGCAG TTAAAGGGGG GAACTGGGAG
TTGGAGGTGA GGACGGGTAA AAGCAAGCTC TCTCTAGCAT TGCAAGTAGA AACGATGGCC
AGCTACTCCG ATACCCTCAA CCTGCAATTT GAATGGGCTA AACAGATCGG CGCTAGCAGC
TCTGGCGACG TTAGTAGCAT AACCACAGAC AGCAACGGTA ATGTCTTGGT GGGGGGTCTT
TTTCAGGGCA ACATTGACAT CGACGGCGAT GGGAACAATG ATTTGACCTC TAATAACGAC
TGGGATATTT ATGCAGCCAA GCTCGACAGC AATGGCAATT TGGTCTGGGC TAAACAGATC
GGCGGTAGCA TTGATGACTA TGTTAATAGC ATAACCACAG ACAGCAGCGG CAATGTCTTG
GTGGGGGGCA GTTTTCGGAG CAACATTGAT ATCGACGGCG ATGGGAACAA TGATTTTACC
TCTAACGGCT TCGGGGATGG TGGGGATGGT TATGTAGCCA AGTTCGACAG CAATGGCAAT
TTGGTCTGGG CTAAACAGAT CGGCGGTAGC TATTGGGACA ATGCTAATAG CATAGCCACA
GACAGCAGCG GCAATGTCTT GGTGGGGGGT TCTTTTGAGA GCTACATTGA CATCGACGGC
GATGGGAGCA TTGATTTGAT CCCTGATGGC TTCGGGGATG GTTATGTAGC CAAGTTCGAC
AGCAATGGCA ATTTGGTCTG GGCTAAACAG ATCGGCGGTA GCAATTGGGA CTCTCCTTAT
AGCATAACCA CAGACAGCAG TGGCAATGTT TATAGCATAA CCACAGACAG CAGTGGCAAT
GTCTTGGTGG GGGGTTCTTT TCGGAGCAAC ATTGACATCG ACGGCGATTG GAACAATGAT
TTGACCTCTA ATGGCGACCT GGATGGTTAT GTAGCCAAGT TCGACAGCAA TGGCAATTTG
GTCTGGGCTA AACAGCTCGG CGGTAGCAAT TGGGACAATG TTAATAGCAT AACCACAGAC
AGCAGCGGCA ATGTCTTGGT GGGGGGTTAT TTTGATGGCA ACATTGACAT CGACGACGAT
GGGAACAATG ATTTTACCTC TAATGGATTC ACGGATGGTT ATGTAGCCAA GTTCGACAGC
AATGGCAATT TGGTCTGGGC TAAACAGATC GGCGGTAGCA GTGATGACTA TGCTAATAGC
ATAGCCACAG ACAGCAGTGG CAATGTCTTC GTGGGGGGTA TTTTTTCCGC CAACATTGAC
ATCGACGGCG ATAGAAACAA TGATTTGACC TCTAATGGAT TCACGGATGG TTATGTAGCC
AAGTTCGACA GCAATGGCAA TTTGGTCTGG GCTAAACAGA TCGGCGGTAG CAGTTTGGAC
TATGCTAATA GCATAACCAC AGACAGCAGC GACAATGTCT TCGTGGGGGG TTCTTTTTAT
GGCAACATTG ACATCGACGG CGATGGGAAC AATGATTTTA CCTCTAATGG CTTCGGGGAT
GGTTTTGTCA TAAAATTATC GGAACAAACT AGCTCCCCCC AAACTAGCCC CCCTCCCACC
GACACTGAAC CAACCCGGTT TGACTTCAAT GCCGATGGAG TCGCAGACAT TTTCTGGCGT
CACCCAAATG GAGCTAACAG AATTTGGTTG ATGAACGATG AGGGCACACG GGATAGTACA
GTTGACCCCG GAAAGTTTGG CAAAGCTTGG GATGTCGCTG GAGTCGCAGA TTTCAATACT
GACGGAGTCG CAGACATTTT TTGGCGTCAC CCAAATGGAG CTAACAGAAT TTGGTTGATG
AACGATGAGG GCACACGGGA TAGTACAGTT AACCCCGGAA AGTTTGGCAA GGCTTGGGAT
GTCGCTGGAG TCGCAGATTT CAATACTGAC GGAGTCGCAG ACATTTTCTG GCATCACCCA
AATGGAGCTA ACAGAATTTG GTTGATGAAC GATGAGGGCA CACGGGATAG TACAGTTAAC
CCCGGAAAGT TTGGTAAAGC TTGGGATGTC GCTGGAGTTG CAGATTTCAA TACTGACGGA
GTCGCAGACA TTTTCTGGCA TCACCCAAAT GGAGCTAACA GAATTTGGTT GATGAACGAT
GAGGGCACAC GGGATAGTAC AGTTAGCCCC GGAAAGTTTG GTAAAGCTTG GGATGTAGCG
GGGGTTGCAG ATTTCAATAC TGACGGAGTC GCAGACATTT TCTGGCGTCA CCCAAATGGA
GCTAACAGAA TTTGGTTGAT GAACGATGAG GGCACACGGG ATAGTAGACT TAACCCCGGA
AGCTTCAGGT CAGCTTGGGA TGTAGCGGGA GTCGCAGATT TCAATACTGA CGGAGTCGCA
GACATTTTCT GGCATCACCC AAATGGAGCT AACAGAATTT GGTTGATGAA CGATGAGGGC
ACAAAGGATA GTGGACTTAA CCCCGCAAGG TTCAGTTCAA CTTGGGATGT AGTTGGGATG
TAA
 
Protein sequence
MKLNTKSIVI IDASVENYQQ LLKGVVTGVK PFLLGGDTDG IQQIGDILQK NPETDTLHII 
SHGSPGCLYL GNSQLSLDTL KGYESQLQQW QLDNLLLYGC NVAAGDGGEE FIDKLHRLTG
AEIAASKSLT GAAVKGGNWE LEVRTGKSKL SLALQVETMA SYSDTLNLQF EWAKQIGASS
SGDVSSITTD SNGNVLVGGL FQGNIDIDGD GNNDLTSNND WDIYAAKLDS NGNLVWAKQI
GGSIDDYVNS ITTDSSGNVL VGGSFRSNID IDGDGNNDFT SNGFGDGGDG YVAKFDSNGN
LVWAKQIGGS YWDNANSIAT DSSGNVLVGG SFESYIDIDG DGSIDLIPDG FGDGYVAKFD
SNGNLVWAKQ IGGSNWDSPY SITTDSSGNV YSITTDSSGN VLVGGSFRSN IDIDGDWNND
LTSNGDLDGY VAKFDSNGNL VWAKQLGGSN WDNVNSITTD SSGNVLVGGY FDGNIDIDDD
GNNDFTSNGF TDGYVAKFDS NGNLVWAKQI GGSSDDYANS IATDSSGNVF VGGIFSANID
IDGDRNNDLT SNGFTDGYVA KFDSNGNLVW AKQIGGSSLD YANSITTDSS DNVFVGGSFY
GNIDIDGDGN NDFTSNGFGD GFVIKLSEQT SSPQTSPPPT DTEPTRFDFN ADGVADIFWR
HPNGANRIWL MNDEGTRDST VDPGKFGKAW DVAGVADFNT DGVADIFWRH PNGANRIWLM
NDEGTRDSTV NPGKFGKAWD VAGVADFNTD GVADIFWHHP NGANRIWLMN DEGTRDSTVN
PGKFGKAWDV AGVADFNTDG VADIFWHHPN GANRIWLMND EGTRDSTVSP GKFGKAWDVA
GVADFNTDGV ADIFWRHPNG ANRIWLMNDE GTRDSRLNPG SFRSAWDVAG VADFNTDGVA
DIFWHHPNGA NRIWLMNDEG TKDSGLNPAR FSSTWDVVGM