Gene Acid345_4181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4181 
Symbol 
ID4072140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4947039 
End bp4948607 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content57% 
IMG OID637986212 
Productphospholipase C 
Protein accessionYP_593255 
Protein GI94971207 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3511] Phospholipase C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.424185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.508071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTTC CCGGTCGGCT CCGTAGCTGT TTCCAGGCCG CTTGTTTCCT CTCGATTCTC 
CTGGGCTCTG CCAATCTCTT CGCGTTGTGC ACGCTCAACA CGCAGAACCA GACCGTCACC
ATCTGCACAC CAGCTCCGAA CGCCACCGTC TCTTCTCCTG TGAACGTCCA GGCGGGAGTT
ACCGATAGCA ACGCCGTCAA GGCGCTCCAG ATCTACGTGG ATGGCGTGAA GGTCTATGAA
ATTGTCGCCA AGACGCTGAA CACCAACGTC ACGATGGCGA ATGGCGCGCA CCGGCTCACC
GTCCAGGCGC AGGATTCCAC CGGCGCAGTC TTTAAATCCA CCGAGAACAT CAATGTCTCG
ACCGCGGGAG CCGGCACCAT CAACGACGTG AAGCACATCA TCTTCATGGT CGAAGAGAAC
CGCTCCTTCG ATAGCTACTT CGGCATGATG GGCGCTTACC GCACGAAGCT AGGTTACGGC
GGCACCTTTA ATGGCGTGCC GTTGAACGCG TCTCTATCCG ATTACAAAGG CACAGGCAAC
GTGAGTCCGT TCCATTACCA GACGGTATGC ACGGACAACA TGACTCCAGC CTGGAACGAG
AGCCACTACT CGTGGCATGC CGGCAAGATG GACTACTTCA TGAAAGTGGA AGGCTCACTG
CCTTCGTCCA TTGATCCCCA GGGCACGCGC ATCATGGGCT ACTACGACCA GACCGACTTG
CCGTATTACT ACGAACTCGC AACGCAGTAT GCCACCAGCG ATACCTGGCA TACGCCGATT
TTGTCTGACA CCATCCCGAA CCGCATGTAC CTCTTCACGG CGACTTCCTT CGGACACATT
CGCCCGCAAG ATGTGCCGCC CAGCGGCGGA TGGACGCAGC CGACGATCTT CCGGGACCTC
TCGCAGCACG GAATTACCTG GCGTTATTAC TACCAGGACA ATTCCGTGTA TCTCGCGAGC
TTCTCCGACT GGAACGCATA TCAGAACAAC GTCTACAACA TCAGCCACTA TTACACCGAC
ATCCAGAACC CGAGCACGCT ACCAGAGGTG ATCTTCATCG AACGCGGCAG CCAGACGGGC
GTTGACGAGC ATCCGCTCAA CAACATCCAG AAGGGTGCAG CCGATGTCGC CAAGATCATC
AACTCGTTCC TGACGAGCCC GAGCTACTCG AGTTCGGTGT TCATCCTGAC CTACGACGAT
CCCGGCGGTC TCTACGATCA CGTGCCGCCA TTCTCCGAAC CCGCACCCGA CAGTATCCCG
CCGATGGTGC GGTCTACGGA CATCAAGGGC GACTTCTTGG AGTCCGGTTT CCGCGTACCG
TTGATCGTGG TTTCCCCATG GACAAAGCCT CATTACGTGT CGCATGTGAA CCGCGACTAC
ACTGCGATGC TCAAGTTCAT CGAGAAACGT TTCGGACTGC CGGCGCTTAC GAAGCGCGAC
GCCGCGCAGG ACGATATGAC CGAGATGTTC AACTTTGCAA CGCCACAGAT CCCAACGCCT
CCCGCTATGC CGACGCAGCC AACCAGCGGT GTCTGCAATA AGAACCTGGA GAAGGCGCCA
GGATACTAG
 
Protein sequence
MHFPGRLRSC FQAACFLSIL LGSANLFALC TLNTQNQTVT ICTPAPNATV SSPVNVQAGV 
TDSNAVKALQ IYVDGVKVYE IVAKTLNTNV TMANGAHRLT VQAQDSTGAV FKSTENINVS
TAGAGTINDV KHIIFMVEEN RSFDSYFGMM GAYRTKLGYG GTFNGVPLNA SLSDYKGTGN
VSPFHYQTVC TDNMTPAWNE SHYSWHAGKM DYFMKVEGSL PSSIDPQGTR IMGYYDQTDL
PYYYELATQY ATSDTWHTPI LSDTIPNRMY LFTATSFGHI RPQDVPPSGG WTQPTIFRDL
SQHGITWRYY YQDNSVYLAS FSDWNAYQNN VYNISHYYTD IQNPSTLPEV IFIERGSQTG
VDEHPLNNIQ KGAADVAKII NSFLTSPSYS SSVFILTYDD PGGLYDHVPP FSEPAPDSIP
PMVRSTDIKG DFLESGFRVP LIVVSPWTKP HYVSHVNRDY TAMLKFIEKR FGLPALTKRD
AAQDDMTEMF NFATPQIPTP PAMPTQPTSG VCNKNLEKAP GY