Gene Tery_2766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2766 
Symbol 
ID4244799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4287744 
End bp4288979 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content42% 
IMG OID638107825 
Producthypothetical protein 
Protein accessionYP_722422 
Protein GI113476361 
COG category[S] Function unknown 
COG ID[COG3825] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.114767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCG TTGAAGAAGT ACCTTTGTTG AAGGTCTTAC TGTCTCTTTT TTACAGTTTG 
CGCCAATATG GCTTGCCTTT GGGAGTTGAA GACTATATGT TAGTGCTGAG GGCATTGCAA
GGTGGGTTTG GTATAGGCGA TCGCGACTCT TTAGAACGAC TATGTTGTAC TTTGTGGACA
AAATCTGAAC AGGAGGCCCG CCTGTTACAT CAACTTCTAG GTCGGGCAAT AACCAATGCT
CCTTCTTCTG CTGAATTACC ACAACCTGTC GAAGATACAC CGAATCCCTC CCCAACTAGC
GCTTCAACTA GACCTGTATC AAAGGAGTCT GAGGAAGTTA TGGACTCTTC GACTTTACCA
TCTACAGCGA CGCCTGTTCG CAACATATCT GAGCAACTTG AGGAAAAAGC TCCATTAAAA
GAGCCAGAAA TACCTGTGTC AAAACCGAGT CCATTGACTG ATATTCCCCT AGAAATAGAT
GAACCAGAAC TGGTCATTCA AGCTATCCGA CATTATAATA GGTCTAATGA AATGATTTCT
GAATACCAAG ATCTAGCTGC TCAATACCTG CCAGTGACCC CTCGGCAGAT TAAGCAGAGT
TGGCGTTTTT TAAGTCGTTC AGTCCCCCAA GGTATATCGG ACAAGTTGAA TGTACCAGCT
ACGGTAGCCA AGATTTGTCA GCAATGCATC TTAATTGAAG CGGTGCTGAT GCCAAATTAT
GTAAATCGAG TTAAACTTGT GTTACTGGTT GATCAAGGTG GCTCAATGAT TCCTTTTCAT
CATTTATCCC GTCAATTAAT AGATAAAGCT CGACGGGGTG GAAATATTGA GCAGGCGAGT
GTTTATTATT TTTATAATTA TCCTGAAATA TATTTTTATA GTGACCCAAC TCGCCTCAAA
GCTCAACTAA TTACAGATAT TTTAGGAGCT ATCGATGAAA GAGCAGGAGT ACTTATGGTC
AGTGATGCTG GAGCTGCCAG GAGTAATTAT AACCCAGAGC GAATTGAGTG CACCCAGAGG
TTTATTGAAC AACTTCGGCA GTCAGTCCGT TATTATGCTT GGCTAAATCC TATGCCTAAT
GATAGTTGGC AAGGCACGAC TGCTGGGGAA ATTGCTCGGT TTGTGCCAAT GTTTGAGATG
AGTCCTCAAG GATTTAATGC TGCTATCAAT GCTTTGCGTG GTCGCTATGT GTATGGGAAA
GATTTTTATG AGTTGAGCCG GCAAAAATCA TTATGA
 
Protein sequence
MNVVEEVPLL KVLLSLFYSL RQYGLPLGVE DYMLVLRALQ GGFGIGDRDS LERLCCTLWT 
KSEQEARLLH QLLGRAITNA PSSAELPQPV EDTPNPSPTS ASTRPVSKES EEVMDSSTLP
STATPVRNIS EQLEEKAPLK EPEIPVSKPS PLTDIPLEID EPELVIQAIR HYNRSNEMIS
EYQDLAAQYL PVTPRQIKQS WRFLSRSVPQ GISDKLNVPA TVAKICQQCI LIEAVLMPNY
VNRVKLVLLV DQGGSMIPFH HLSRQLIDKA RRGGNIEQAS VYYFYNYPEI YFYSDPTRLK
AQLITDILGA IDERAGVLMV SDAGAARSNY NPERIECTQR FIEQLRQSVR YYAWLNPMPN
DSWQGTTAGE IARFVPMFEM SPQGFNAAIN ALRGRYVYGK DFYELSRQKS L