Gene Acid345_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0140 
Symbol 
ID4069725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp148330 
End bp150936 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content62% 
IMG OID637982140 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_589219 
Protein GI94967171 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0038467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACTCCC CCGGGCCCAC TGACGTTCCG CGCTCTCCTT TCTGCGGCGG CAAACAACCG 
CTGTTTCTTA CGGCGTTGGC GTTTGGCTGC GGCATCCTTG CCGCGCGATT TGTCTTTCAC
CCCTCAAATG CCTGGCTGAT CGCTGCGCTG CTGGTGGTGG CCTGCGCGAT TGGGTTGCGC
AAGTCGCCGA CGGTCGCGTT TGCGGGGATG ATGCTCTCGG CCGCGATGGC TGGTGGGCTG
GCGCATGAGC TGCGCGGGCC AGAGGGCGTG GACTATCCGC AGGCGATGAG CGATGGAGAG
TGCACCGTTA CGGCACACGT GGTGCGCGAT GGCGTGTTGC AGCGCGGGAT GTTCGGCGGG
CAGCAGCAGT CGGTGGACCT GGAGACGGAG CGAGTGCAAC TGGCCGATGG AAGTGGGTTC
GGGAGGCCGG TGGGATTGCG GCTGTCGGTG TACGCGCGGA AATCCGATTA CGAGGATGAA
GAGCAGCAGC AGGCAGGAGT GCTGCCGATG CCGGTGCTGC GCTATGGGCA GCGGGTACGG
CTGACGGCGA AGCTGCATGA GGCGCGGAAC TACAAGAACC CGGGAGCGTG GGACTATCGC
GGATATTTGC GGGCGCAGGG CATCGAGTTG CTGGGAAATG CGCGGGTGAG TTCGGTTGAA
GTGCTGCCGG GTTTTGGCGG AAGCCGGTGG AAGCAGTCGC AGCATCGGGC ACGGGCGGCG
GTGCTGGCGA AGATCCACGA GATCTGGCCG GCGGAACAAG CAGCGCTGTT CGATGCGATC
GTGATTGGCG AGCGCTCCGA GCTGGGCAGT GAATTGAAGA CCAGTTTCCA GAGTACGGGG
ACGTTTCACA TTCTCGTGGT TTCAGGGATG AACGTCGGGA TTTTGGCGTT TGGATTTTTC
TGGCTGTTCC GACGCATCCG CATGGGAGAT GTGGTGGCGA CGATCTGCAC GCTGGCGTCG
TCGTTTGCGT ATGCGTGGCT GACAGATTTG GGGTCGCCGA TTTTGCGGGC GGTGTGGACC
CTGACGATTT ATCTGCTTGC GCGCTTGTTG TTCCGGCAGA GTTCGCGGCT GAATGCGATT
GGCGTGGCCG CGCTGGTGAT CCTGGCTTGG TCGCCGGATG CGTTGTTCGA CGCGAGCTTC
CAACTGACGT TCTTGTCGGT GGCGGTGATC GCGGGCGTGG TGGTGCCGTG GATCGAGAGG
ACCTCCGATC CTTATCGCAA GGCGCTGCAA AACCTGAATA TCGTGCGCGC GGACCGCGGG
TATCGGCCAC AGCAGCAGCA ATTCCGGCTG GATCTGCGGA TGATCGGGGG ACGGCTTGCG
CGAGTCCTCC CACACCGGAT CGTGCGGCCA ATGTTGACGA CACCATTCCG CGTAACGTTT
GCGGCGTATG AGCTGCTGCT GGTTTCGTTG CTAATGGAGT TCACGCTGGC GCTTCCGATG
GCGGTGTACT TCCATCGCGT GACGCTGATG GCGCCGGTGG CGAATGCATT GGTGGTGCCC
CTGACCGGTG TGCTGATGCC GGCGTGCGCG GCGGCAGTGT TGCTGGGATT TGTGTGGTTG
CGACTGGCGA GATTGCCGGC GATGGTGGCG CTGTGGTCGC TGAAGGCGAT TACGGGTGCG
GTGGCGGTGC TGGGGCATTT GCGGGTGTCG ACGGGGAGAG TGGCGACGCC ATCGCTCGTC
GTTGCGTTGA GCGCTGCCGT GGCGATTGCG CTGGCGATGT TGCTGGCGCG GCGGAGGTGT
TGGCTGGCAG CGGTCGGGAG CCTGGCGATC ATTGCTTCTG GTGCGGCGGT TCTCTTCATC
GTTCCGCAAC CGCAGCATAA GTCCGGCGCG CTGGAAGTGA CGGCCATCGA TGTGGGGCAG
GGTGATTCGC TGTTGCTGGT GTATCCGAAT GGGCAGTCGA TGTTGCTCGA TTCGGGTGGA
CCGCTGGGTG GGTCGCATTC CGACTTTGAC GTGGGCGAGC AGGTGACTTC GCCGTATCTG
TGGGCGCGTG GAATCAACCG ACTCGATGTT GTCGCTTACA GTCATCCGCA TTCGGACCAC
ATGGGATCGA TGGCGACCAT CATTCGCAAC TTCCGGCCGC GGGAGTTGTG GCTGGGATTT
GCGCCGCCGG TGAGGGATGT GGAGAACGTA TTGCAAGCCG CGCGGGAAGA GCATGTGACG
GTTCGGTTCT TTCGCACGGG AGATCAGTTT GGTTTCGGTG GGGCGAATGT ACGGGTGCTG
CTGCCTCTGC GAGACCAGGA GCCGCACATG CCGCCGAAGG ATGATGATGT TCTTGTGCTG
AAGATTGCAT ATGGGAAGAC TTCAGCGCTG CTGATTGGGG ATTCGCATAA AAAGGAAGAG
CGGGAACTGA TCGACCTCGC GCCAGAGGCG GATCTGTTGA AGGTGGCGCA TCACGGGAGC
AATACGTCGA GTTCGCCGGA GTTCCTGGCT GCGGTGCATC CGAAATTTGG GGTGATTAGC
GTGGGGGCGA GGAACTCGTT CAAGCATCCG CGGCCGGAGG TGCTGGAGCG GCTGGCGTCG
TTTGGGGTGC AGACCTATCG GACGGATATG GCGGGGGCTA CGACGTTTTA TCTGGATGGA
ACGAACGTGA GCGTCGTGCG GAGATGA
 
Protein sequence
MNSPGPTDVP RSPFCGGKQP LFLTALAFGC GILAARFVFH PSNAWLIAAL LVVACAIGLR 
KSPTVAFAGM MLSAAMAGGL AHELRGPEGV DYPQAMSDGE CTVTAHVVRD GVLQRGMFGG
QQQSVDLETE RVQLADGSGF GRPVGLRLSV YARKSDYEDE EQQQAGVLPM PVLRYGQRVR
LTAKLHEARN YKNPGAWDYR GYLRAQGIEL LGNARVSSVE VLPGFGGSRW KQSQHRARAA
VLAKIHEIWP AEQAALFDAI VIGERSELGS ELKTSFQSTG TFHILVVSGM NVGILAFGFF
WLFRRIRMGD VVATICTLAS SFAYAWLTDL GSPILRAVWT LTIYLLARLL FRQSSRLNAI
GVAALVILAW SPDALFDASF QLTFLSVAVI AGVVVPWIER TSDPYRKALQ NLNIVRADRG
YRPQQQQFRL DLRMIGGRLA RVLPHRIVRP MLTTPFRVTF AAYELLLVSL LMEFTLALPM
AVYFHRVTLM APVANALVVP LTGVLMPACA AAVLLGFVWL RLARLPAMVA LWSLKAITGA
VAVLGHLRVS TGRVATPSLV VALSAAVAIA LAMLLARRRC WLAAVGSLAI IASGAAVLFI
VPQPQHKSGA LEVTAIDVGQ GDSLLLVYPN GQSMLLDSGG PLGGSHSDFD VGEQVTSPYL
WARGINRLDV VAYSHPHSDH MGSMATIIRN FRPRELWLGF APPVRDVENV LQAAREEHVT
VRFFRTGDQF GFGGANVRVL LPLRDQEPHM PPKDDDVLVL KIAYGKTSAL LIGDSHKKEE
RELIDLAPEA DLLKVAHHGS NTSSSPEFLA AVHPKFGVIS VGARNSFKHP RPEVLERLAS
FGVQTYRTDM AGATTFYLDG TNVSVVRR