Gene Tery_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1870 
Symbol 
ID4245491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2857816 
End bp2859453 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content35% 
IMG OID638106991 
Producthelix-hairpin-helix repeat-containing competence protein ComEA 
Protein accessionYP_721599 
Protein GI113475538 
COG category[I] Lipid transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes
[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.791569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.192148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTACTTT ATTCTTTGAA ACTGCGATTT AGCTATGTTT TACTATTAAC TTTGAACCTA 
ACTGCTTGTC TGAATAAACC TCAAGAGTTA GTTCCTCCTC TATCCCCATT GCCACAAGAA
CCTCTAATTA AATGTTACTT TAATCAATCT CAAGCTTCAA GCTATACTGA AGTCCTACGC
CAGCAAACCC GACTAGGAGA TAATCTTGAA GAAATAATTG TTAATACTAT TGCTGATGCT
AATTCTACTG TGTATGTGGC GGTTCAAGAG TTAAGTTTAC CAAGTATCAC TCAGGCTTTG
GCCGAAAGGT ATCGAGCTGG GGTCAAAGTA CGAGTTATTA TTGAAAATAC CTATAATCTT
CCCTGGGAAA GTTTAAGTCC ATCTGAAGTC TCTCAACTGC CTGAACGGGA GCGCGATCGC
TACAATGAAT TTATTAGTAT CTCTAGCAAT AACATAGATG GTGACATTAG TCAATATAAA
GTGAATCATA GGGATGCTCT AATAATTTTA AGCAATGCAG GAGTTCCCTT GATCGATGAT
ACTGCTGATG GCTCTAAGGG TAGTGGTTTG ATGCACCATA AATTTTTAGT TGTGGATGGT
AAAACCACCA TTGTCACCTC GGCTAATTTT ACTACCAGTG GTATTCATGG GGATTTTTCG
GAGTCTCTGA GTCAAGGAAA TACTAATAAT CTCTTAAAAA TAGAAAGTGT TGAATTAGCT
GAACTATTCA CAGAAGAATT TAATCTCTTA TGGGGAGACG GACCTGGAGG TAAACTAGAT
AGTAAATTTG GCATCAAAAA ACCATTTAGA CAGACAAGAG AAGTATTGGT TGGTGATACA
AAAGTTGCAG TACAATTTTC TCCTACTTCT AAGAGTCTAT CTTGGCAAAA AAGTGTAAAT
GGTCTAATTA ATCAAACATT AGAAAAAGCA CAAAAATCTA TTAATTTATC CCTATTTATA
TTTTCAGCAC AACTATTAGT TAATACCCTA GAAAAAAAAT CTCTACAAGG AGTATTAATT
CAAGGTTTAA TTGACCGTAG TTTTGCCTAT CGTTACTATA GTGAAGGGTT AGATATAATG
GGAATTGCTC TGCCAAATAA ATGTAAATAT GAAACTAATA ATCAGACTTG GAAAAAACCT
ATATATACAG TAGGTGTACC TAACTTACCC CCTGGCGATC GCCTCCATCA CAAATTTGGA
ATTATTGATA ATTCAATTGT AATTACTGGT TCTCACAACT GGACTGAAGC AGCTAATAAA
AACAATGATG AAACTTTATT AGTAATAGAA AGTTCCACAG TAGCAGCTCA TTTTGAACGT
GAGTTTCAAA GACTTTATCA AACAGCAACT ATGGGTATAC CATCATGGTT GATAAAAAAA
GTTGAAAAAC AACAACAAGA ATGTGGTGAT AGATTAACAA AAACTGAAAT TTTCCTAACA
AATACTAATA ATAAAATTAA TCTCAATACT GCTACTGCTG AAGAATTAGA AACACTTCCT
GGAGTCGGAC CAAAATTAGC AGAACGCATT ATTCAAGCTA GGAAAAATAA ACCTTTTACA
TCTTTAGCAG ATCTAGATCA GATATCTGGT ATTGGACCTA AAATATTAGA TAAATTAAGC
GATCGGGTAA CTTGGTAA
 
Protein sequence
MLLYSLKLRF SYVLLLTLNL TACLNKPQEL VPPLSPLPQE PLIKCYFNQS QASSYTEVLR 
QQTRLGDNLE EIIVNTIADA NSTVYVAVQE LSLPSITQAL AERYRAGVKV RVIIENTYNL
PWESLSPSEV SQLPERERDR YNEFISISSN NIDGDISQYK VNHRDALIIL SNAGVPLIDD
TADGSKGSGL MHHKFLVVDG KTTIVTSANF TTSGIHGDFS ESLSQGNTNN LLKIESVELA
ELFTEEFNLL WGDGPGGKLD SKFGIKKPFR QTREVLVGDT KVAVQFSPTS KSLSWQKSVN
GLINQTLEKA QKSINLSLFI FSAQLLVNTL EKKSLQGVLI QGLIDRSFAY RYYSEGLDIM
GIALPNKCKY ETNNQTWKKP IYTVGVPNLP PGDRLHHKFG IIDNSIVITG SHNWTEAANK
NNDETLLVIE SSTVAAHFER EFQRLYQTAT MGIPSWLIKK VEKQQQECGD RLTKTEIFLT
NTNNKINLNT ATAEELETLP GVGPKLAERI IQARKNKPFT SLADLDQISG IGPKILDKLS
DRVTW