Gene Tery_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2114 
Symbol 
ID4243950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3302074 
End bp3303507 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content33% 
IMG OID638107221 
ProductWD repeat-containing protein 
Protein accessionYP_721822 
Protein GI113475761 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.485038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.942485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATT TGTTTGCTAT AATTTCCCAA AACACTAACA TTTTCCAATC TATGGATATT 
AAAGAAGTTT TGAAGTTGGT TGAGGAGCTA ATTTATGACC ATACCGGAGA AAATTTAGAC
TATCTGCCAA AAACTATATT GCAAGGCACT TTAGAAGGTC AGACTTACAA AAAAATTTCT
GAGGAAACTT ACGCTAGCGA AACCCATGTT AGGTATATTG GTGCTAAATT ATGGAAAACT
TTATCGGAAA TACTCGGTGA AAATATTACC AAAGCAAATT TTAGAACAAT TTTAGAAAAT
ACAAATATTT ACAATTTTGT ACCTGTTGTT TGTAGAGATA ATGCAACAAA TAACATTAAT
ATCTGTCCGT CAAACCCTCA CGCTTCCGCA ATAAAAACCA ACTCTGAACA AACACCAACT
CAACATATTA TTGACCTAGA CAAAGCGCCA GAAATTATCA ACTTTTATGG GCGCACAGAG
GAAATATCAA CCATGACAAA ATGGATAGTG AGCGATCGCC TCCGTGTCAT TTCCCTTTTA
GGAATAAGTG GTATCGGCAA AACGACCCTT TCTCTAAGAT TAATTCAACA AATAAATGGG
TTAGAAAACC AAACCCCTAC CTTTAACTAC ATCATTTATC GGACCCTACG CTTTTCTCCC
ACCCTCACCA CAACCCTCAC TAACCTCCTG CAAATTTTCG CACCAGAAAC AGAAGTTTCC
CACAATATCG ACATCCTATT ATCACAACTA CAAAAATACC TAGCAAAATA TTCTTGCTGG
ATCATATTTG ATGACGTACA CAAATTATTT ACTCCGGGAA AACTTGCTGG TCAATACAAA
TCTGGTTATG CAAATTATCG GGATTTTTTT CAACTATTTG CCACAGTCTC CCATCAGAGT
TGTCTACTAT TAATAAGTAG AGAAAAAATA GCAGAAATTT TCAAATTAGA AACAGAAAAT
TATCCAGTGG AAACTTTAAT TTTAGGAAGT TTAGGAGCTT CGTCTAAAGC AATCTGTCAA
AGGCACAAGT TAGTAAATCA AGAATCCTGG GAAAAATTGA TTAATAATTA TCAAGGAAAT
CCGCAATGGT TAGATATAAC AGCAACAATA ATTCAAGAAT TATTTAGAGG TAAAGTAGCA
GAATTTATGG AATATAAAAC CCTAATTTTA CCAGAAGCAT TACAAGCCGA ATTAGAACAA
CAATTACAAA ATTTATCCCC TCTTGAAATC AGAATGATGA AGCAAATAGC CAACCAAAGC
CAACCTATTT CAATAGCGGA AATAAGTAGA CAATCACAAC TATCTATTCA GGAAAATATT
AACTTAATTC AATCCCTAAA GAAGCGGTTA TTATTATATG GGCAGTTAGA GAATAATTTG
ACAGTATTTA CTCTGAATTC AATCTGGAAT CAATACTTAA AAAATAAAAA TTAG
 
Protein sequence
MYDLFAIISQ NTNIFQSMDI KEVLKLVEEL IYDHTGENLD YLPKTILQGT LEGQTYKKIS 
EETYASETHV RYIGAKLWKT LSEILGENIT KANFRTILEN TNIYNFVPVV CRDNATNNIN
ICPSNPHASA IKTNSEQTPT QHIIDLDKAP EIINFYGRTE EISTMTKWIV SDRLRVISLL
GISGIGKTTL SLRLIQQING LENQTPTFNY IIYRTLRFSP TLTTTLTNLL QIFAPETEVS
HNIDILLSQL QKYLAKYSCW IIFDDVHKLF TPGKLAGQYK SGYANYRDFF QLFATVSHQS
CLLLISREKI AEIFKLETEN YPVETLILGS LGASSKAICQ RHKLVNQESW EKLINNYQGN
PQWLDITATI IQELFRGKVA EFMEYKTLIL PEALQAELEQ QLQNLSPLEI RMMKQIANQS
QPISIAEISR QSQLSIQENI NLIQSLKKRL LLYGQLENNL TVFTLNSIWN QYLKNKN