Gene Acid345_2426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2426 
Symbol 
ID4072860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2865831 
End bp2867123 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content61% 
IMG OID637984442 
ProductHipA-like protein 
Protein accessionYP_591501 
Protein GI94969453 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.833222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGACG TATCCGTTAT CGAGGCCCGC GCGTGGGGGC GCACCGTGGG AGCAGTCACG 
CTCGATCCGA CTCTCGCGTA TTACGCTTTC GAATACGATC CGGCGTGGAA GCGGACTGGC
GTCGAGCTTG CTCCCTTGCA CATGCCGTTG AAGGGAAGCG GGCCGATCTA CATTTTTCCG
ACGCTACCGG AAGCCACCTA CTTTCGCTTG CCCGCGCTGT TGGCGGACGC TCTTCCTGAT
GATTTTGGCA ATGCTCTCAT CGATGCCTGG ATGGCCCAGC GCGGTGTCGA GAAGAGCGCG
ATCACAGTGC TCGATCGCCT GGCGTACATG GGGAAGCGCG GAGTGGGCGC GCTGGAGTTT
CGCCCCGCGA GAGGATCGCA TAACGAGAGT TCCGCGCCGA TCGAAATGAA AGAGCTGGTG
GAAGAGGCGC GGCGCCTGGT GCAGGGAACC TTCGCGGTCG ATCATGAAGC GCAGGCGGCG
CTGGCCAATA TCATCAAGGT GGGAACGTCG GCGGGCGGGG CGCGAGCGAA GGCCGTCATT
GCCTGGAACC CGGCAACCGA CGAAGTGCGC AGCGGACAAT TCGATGCCGC TCCCGGCTTC
GAACATTGGC TGCTGAAATT CGATGGCATA GGCAAAGATA AAGAGCTCGG GACCGGCGAA
GGCTATGGCC GCATCGAATT TGCGTACTAC CTGATGGCGA CCGGTGCGGG CATCGAGATG
GCGGTGTCGC GGCTGCTCGA GGAAAATGGC CGCGCGCATT TCATGACCCG GCGGTTCGAT
CGCGAAGTGG TCAACGGCAA GACGCGGAAG CACCACATTC AAACCCTGTG CGCCATGAAT
CATCTCGATT TCCGCCAGCG CGGAACTCAC GATTATGCGC AGCTCTTTAT GACCGCCAGC
GCGCTGGGGC TCGACGATGG CGCACTGGAC CAGATATTCC GTCGTATGGC ATTCAACGTC
ATGGCGCGCA ACTGCGACGA TCACACCAAG AACTTCAGTT TCATCCTTCG CGAAGGCCAA
CCGTGGCGCC TAGCCCCTGC TTACGATGTG ACCCACGCCT ACAACCCCAA GGGTGAATGG
ACCTACCAGC ACCTGATGAG TGTCAATCGT AAGTTCGACG GCATCTCGAA GGAAGACTTG
CTCGCAGTCG CCGATCGCTT CAGCGTGAGA CGACCGGAGC GCGCGCTGTC CGAGGTCCGT
GCCGCCATCG ATGGTTGGCC GCAGTTTGCA AAGCAGGCCG GGCTCTCCGC AGCGCTGCGG
GATCGCGTTG GTAAGGATTT GCTTCCGTTA TAG
 
Protein sequence
MKDVSVIEAR AWGRTVGAVT LDPTLAYYAF EYDPAWKRTG VELAPLHMPL KGSGPIYIFP 
TLPEATYFRL PALLADALPD DFGNALIDAW MAQRGVEKSA ITVLDRLAYM GKRGVGALEF
RPARGSHNES SAPIEMKELV EEARRLVQGT FAVDHEAQAA LANIIKVGTS AGGARAKAVI
AWNPATDEVR SGQFDAAPGF EHWLLKFDGI GKDKELGTGE GYGRIEFAYY LMATGAGIEM
AVSRLLEENG RAHFMTRRFD REVVNGKTRK HHIQTLCAMN HLDFRQRGTH DYAQLFMTAS
ALGLDDGALD QIFRRMAFNV MARNCDDHTK NFSFILREGQ PWRLAPAYDV THAYNPKGEW
TYQHLMSVNR KFDGISKEDL LAVADRFSVR RPERALSEVR AAIDGWPQFA KQAGLSAALR
DRVGKDLLPL