Gene Acid345_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0741 
Symbol 
ID4069083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp912193 
End bp914130 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content60% 
IMG OID637982747 
Producthypothetical protein 
Protein accessionYP_589820 
Protein GI94967772 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0951377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCG CTTGGCTGCT TGCTCTCTTG GTCGGTGTCT TGTGCGTGAA TTCCCCAGCG 
CAGTGGACTA CCGACGTACT CGGCTCGCAT GATCTCTCGC CAGGCGGCAC TTCGCCGATC
AAGGGCCGGC TGAATTCCGG ATGCCAGTAT TGCCATGCGC CGCACTCAGG AATCACGATG
GGCTCGGCGC CCCTGTGGGC GCAGACACTC TCGAAGCAGA CCTACACCAC GTATACGAGT
ACGACGCTCA AGAACCTGAC CACCCAACCA CCTCTCGGGG GCGATAGCAA TCTTTGCCTG
AGCTGCCATG ACGGCACCGT CGCGCCGGGG CAGACGGTTC CTTATGGGCG CCTGCGGATG
ACAGGCAACA TGCTGCCGCA GGACAAGTTC GGCAGCAATC TCGGGGGCTC GCATCCATTT
AGTTTTCGAG CTTTGACTAC CGATTCGCCG GACCTGGTCA CGACCCTGAT CTCGAGCCAC
AAGACCGCGG ATCCGCAAAA CGCGGTGAAG TTGATTAACA ACAACGTGGA ATGCACGAGC
TGCCACAATC CGCACGTGCA GGCGATTGAC ACGGTGGCGC AGCAGTTCCT GGTACGCGAC
GGTTCGAACG GAGCACTGTG CCTGGCGTGC CATGAGCCGG GTGCACGCCA GGTAAGCAAC
CAGAACAACC CTCTGTCGCC GTGGACGACG AGTATCCACG CGAACACGAA CAATAAGCTT
TCGCAGGGCG CGGGACTCGG CAGCTATACG ACGGTCGGCG CAAATTCATG CATATCGTGC
CACGTTCCAC ATAGCGCACT GGGCGGAGCA GAATTGTTGC GGCAGCCGGC ATCGCCAGTG
CCGAACATGG ATTCGGCGAC ACAGAATTGC ATTACCTGCC ACAACGGGGG ATCGAACATC
TCACCGGCAA TTCCGAACGT GTATGCCGAA TTTGCAAAGA CGGGCCACCC GTATCCGGCC
GGCAACAACA CCCATAGCGC CGGGGAGGCA ACGGTCCTCG AAAACAATCG TCATGCGACT
TGCGTTGACT GTCACAACGC GCACGGATCG CAGCAGGTGA CGAGCTTTGA TGCTCCGCCG
AAGATACGCA TTTCGCAAAC CAGCACCAAG GGCTTAGGCG TGGACGGCAC CACGCAAATC
GATCCGGCGG TGAATCAGTA CGAGAACTGC TTGCGTTGCC ATGGGCCGAG TTCCGGGAAG
ACGACGTTGA CGATCTTCGG GTACGCGCCC GCGTGGGCGG CAGAGAATCC CGGCGATTCG
CTGAACGTGA TTTATGAATT CAACTCGTCC TCGACGTCGC GACATCCGGT GATGCTCGAT
CGCAGCAGCG GGTATCCCCA ACCAAGCTTG CGCGCGTTCA TGGTGCAACT CGACGGGAAG
ACCCAGGGGC GCTCCATGGG GCAGCGCATC TTCTGCACGG ATTGTCATAA CAGCGATGAC
AATCGTGAGG GTGGTGGAAC CGGGCCAAAT GGTCCGCATG GCTCGACGTT CAGCCACATC
CTTGAGCGCC GCTACGAATA CAGCCAGGTG GCTTCCGGCG CCGGTGCGGG TACGACGATC
ACAAACCTGA TTCCGAATCC GCCGCTCGAT CCTTCCGCGA ATGGACCGTA TTCGATGTGC
GCGAAGTGCC ACGACCTTAC GAACATCGTT TCGGATGCGA GTTTCTTGCC CGACAAAAAC
GGTAAGGGAG GCCACGCGAC CCACATCAAC GACGGGTTCT CCTGTTCCAT CTGCCATACT
TCGCATGGAA TGGGCGGAAC GGCGGCAGGC ATCTCCGGCG AGCGCATGGT GAACTTCGAC
CTGAAGGTCG TCGCGCCGAA CAATGGCACG CTGGCGTACT CGCACAGCGC AAATACCTGC
ACCCTGACCT GCCACGGCTA CGCGCACTAC TCTAACGGCT CTGTGACCCC GGCTCTCGCT
AAACCGGGGG TGAAGTAA
 
Protein sequence
MKRAWLLALL VGVLCVNSPA QWTTDVLGSH DLSPGGTSPI KGRLNSGCQY CHAPHSGITM 
GSAPLWAQTL SKQTYTTYTS TTLKNLTTQP PLGGDSNLCL SCHDGTVAPG QTVPYGRLRM
TGNMLPQDKF GSNLGGSHPF SFRALTTDSP DLVTTLISSH KTADPQNAVK LINNNVECTS
CHNPHVQAID TVAQQFLVRD GSNGALCLAC HEPGARQVSN QNNPLSPWTT SIHANTNNKL
SQGAGLGSYT TVGANSCISC HVPHSALGGA ELLRQPASPV PNMDSATQNC ITCHNGGSNI
SPAIPNVYAE FAKTGHPYPA GNNTHSAGEA TVLENNRHAT CVDCHNAHGS QQVTSFDAPP
KIRISQTSTK GLGVDGTTQI DPAVNQYENC LRCHGPSSGK TTLTIFGYAP AWAAENPGDS
LNVIYEFNSS STSRHPVMLD RSSGYPQPSL RAFMVQLDGK TQGRSMGQRI FCTDCHNSDD
NREGGGTGPN GPHGSTFSHI LERRYEYSQV ASGAGAGTTI TNLIPNPPLD PSANGPYSMC
AKCHDLTNIV SDASFLPDKN GKGGHATHIN DGFSCSICHT SHGMGGTAAG ISGERMVNFD
LKVVAPNNGT LAYSHSANTC TLTCHGYAHY SNGSVTPALA KPGVK