Gene Acid345_2127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2127 
Symbol 
ID4072369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2541774 
End bp2543849 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content58% 
IMG OID637984142 
Productphosphoesterase 
Protein accessionYP_591202 
Protein GI94969154 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3511] Phospholipase C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTCA AGCAGATCGC ATCACGAGCT CTGCGCAGCA TGCGCACTGG ACTTGTAACG 
CTCGCAATGT TCCAGTTCGC AGTCGGCTCA AGCTTCGCAG GCGTCGTCGA AGACAATCGT
TCGACGAAAG ATGCGGATAC GCGCACCCCC ATCAAGCATG TGATCGTCAT CATCGGTGAA
AACCGTACCT TCGATCACGT CTTTGCCACC TACATCCCCA AACGCGGCGA GAGTGTTTGG
AACCTGCTCT CTCAAGGCAT CATCAACGCC GACGGAACAC CCGGACCGAA GTTCAAACTC
GCTGAGCAAA AGGCAGCCAG TGACGCTGCT CCCGATGCCT TTCTTCTGAG CCCACCCAAA
ACCTCGTTCC CCGGAAATGT AATGCCCGCT CCCCTCGTGG GAGGCGCAAA AGACTCTCCG
ATCCCGGGCG ACAGCCTCAC GCTGGCGCAG CAGTCGGAGA ACGGACTGCC CTCCGACTAT
TACCAGTACC TAATCTCCGG CGGAACGGGA CTGACATCGC GGACACCCGA TACGCGCATT
ACCAACGTGA ACAACCTTCC TCCAGGGCCG TTCCAGCTCA CCAACAGCAG CACATTGGAC
TACAACGCCT ACGCGGCCAG CCCAGTGCAC CGCTTCTACC AGATGTGGCA GCAGCTGAAT
TGCAGCGCGA AGCGTGCGAC AAAATCCAAT CCTTCGGGCT GTAACTCGAA ACTCTTCCCG
TGGGTGGAAA CGACGGTGGG AGCCGGCGCG AACGGCATTG CGCAGCCTTC CAACTTCAGC
ACCGACTATG CTCCCGGCGC GGTGACCACG GGCGAAGGCT CGACCGCGAT GGGCTTCTAC
AACATGCTGC AAGGTGATGC TCCTTACTTC AAGAAGCTCG CTGACCAGTA TGCGATGAGC
GACAACTTCC ATCAGTCCGT CCAGGGCGGC ACTGGTGCGA ACCACATTAT GCTTGGCCAC
GGCGACATGA TCTTCTTCAG CGATGGCAAG GGGCACGCTG CGAAACCGCC GCACAACCAG
GAAGTCGCGA GCGGCACTCC GAGTGCGGGG ATCGTAGATC AAATTGAGAA CCCGAATCCG
GCTCCGGGCA CCAACAACTG GTACACCGAA GATGGCTACG GCGGCGGTTC CTTCGGCTCG
CCATCGTACG GCGGTGGCTC GTATAGTGAC TGTGCGGACC TCAGCCAGCC GGGCGTAGCG
GAAGTCGTGA CCTATCTTCA TTCGCTTAAG CGTCCGGTGA AGGCAAATTG CGAGAAGGGC
CACTATTACC TGCTGAACAA CTACAACCCG GGCTACTTCG GCAACGGCAA CAATGCCTAC
ACCGACACCA ACGCGAACAA CACGGTGTTC ACGATCCCGC CGTCCGAGGT GCGCAGCATC
GGCGACGAGA TGAGCGACAA GAAGGTCTCG TGGAAGTATT ACGGCGACCA GTGGAACAAC
TATGTGCCCG ATCCTTACCA GTTGAATTAC GGCGCGATCG GCACGAACAC CGACGAATAC
TGCAACATCT GCAACCCGTT CCAGTACGAC ACCTCGATCA TGGCCGATGC GAAGGTCCGC
AACGCACACA TGAAGGACAC CACGGACCTT TACCACGACA TTCAGAACAA CTCGCTTCCG
GCCGTGTCGT TCGTAAAGCC GAGCGGCCTG GTAGATGGCC ATCCCGCTTC GTCGAAACTG
AACTTGTTTG AAGGCTTCAC CAAGAAAATT GTGGACGCAG TGAAGTCGAA CCCCGAGGTT
TGGAAGGACA CCGCAATCTT CATCACCTTC GATGAAGGCG GCGGCTACTA CGACTCCGGC
TTCATCCAGC CGCTCGATTA TTTTGGCGAC GGCACCCGCA TCCCAATGCT CGTGGTCTCG
AAGTATTCGA CCGGCGGCCA CATCGCTCAC GACTACGCCG ACCACGTCTC GATCCTCAAG
TTCATCGAGC GCAACTGGAA GCTGCAGCCA GTGACGAAGC GGAGCCGCGA CAACTTCCCC
AACCCGAAAG CTGAGTGGGA CAACCCGTAC GTTCCCGTCA ACGGTCCTGC GATCAGCGAT
ATGTTTGAGC TGTTCGACTT CGGCCGCGGA CGATGA
 
Protein sequence
MSLKQIASRA LRSMRTGLVT LAMFQFAVGS SFAGVVEDNR STKDADTRTP IKHVIVIIGE 
NRTFDHVFAT YIPKRGESVW NLLSQGIINA DGTPGPKFKL AEQKAASDAA PDAFLLSPPK
TSFPGNVMPA PLVGGAKDSP IPGDSLTLAQ QSENGLPSDY YQYLISGGTG LTSRTPDTRI
TNVNNLPPGP FQLTNSSTLD YNAYAASPVH RFYQMWQQLN CSAKRATKSN PSGCNSKLFP
WVETTVGAGA NGIAQPSNFS TDYAPGAVTT GEGSTAMGFY NMLQGDAPYF KKLADQYAMS
DNFHQSVQGG TGANHIMLGH GDMIFFSDGK GHAAKPPHNQ EVASGTPSAG IVDQIENPNP
APGTNNWYTE DGYGGGSFGS PSYGGGSYSD CADLSQPGVA EVVTYLHSLK RPVKANCEKG
HYYLLNNYNP GYFGNGNNAY TDTNANNTVF TIPPSEVRSI GDEMSDKKVS WKYYGDQWNN
YVPDPYQLNY GAIGTNTDEY CNICNPFQYD TSIMADAKVR NAHMKDTTDL YHDIQNNSLP
AVSFVKPSGL VDGHPASSKL NLFEGFTKKI VDAVKSNPEV WKDTAIFITF DEGGGYYDSG
FIQPLDYFGD GTRIPMLVVS KYSTGGHIAH DYADHVSILK FIERNWKLQP VTKRSRDNFP
NPKAEWDNPY VPVNGPAISD MFELFDFGRG R