Gene Acid345_1577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1577 
Symbol 
ID4069015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1926441 
End bp1928081 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content60% 
IMG OID637983586 
Producthypothetical protein 
Protein accessionYP_590653 
Protein GI94968605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.188499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.121402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAG GGAACAATCA AGGCGGCCGT CCACTTCTGG CGTGCGAGAT CACGCGCACC 
CAGGTGTTTG CGACGCGCTG GGCCGAAAAA ACTACTGGCG TCGAGGTACT GCAAGTACGC
ACGATTCCGG GCGGTGTCTC TCCGAATTTG ACGAGTCAGA ACGTTTCGGA CGCAGGCGCG
CTTAAGAGCG TGGTGGCGGA TGCGTTGCAG GCCAGCGGCG CCCGCACCAA GGACGTGACT
CTCATCGTTC CTGACGCCGC CGTTCGCGTT GCGCTTCTCG ATTTCGATAC GCTTCCTGAA
AAGAAGCAGG AAGCCGATGC GGTGGTGCGC TTTCGACTGA AGAAGTCATT GCCGTTCGAA
GTAGACCGCG CAGCCATTTC CTACCACGCT CAGCCGAATG GCACGACATT GCGCGTGCTC
GTGTGCGTGA TGTTGAACTC GGTGCTGCAC GAGTACGAGT CTGCAGTGCG CGACGCAGGA
TTTTTGCCGG GTGTCGTTCT GCCATCGACC CTGGCAGCGC TGGGCAATGT AAGTGTTGAC
GCTCCCACGA TGGTCGTGAA AATCGCAGAC GGGACCACGA CGATCGCGAT TCTCGATCAG
GGACGCCTGC AGCTTTATCG AACTCTCGAC CACGGCTCTC CCGACGTGGA GCCAGCCTCG
TTGGCGCATG ATATCTACCC GTCGGTCGTG TTCTTTCAAG ACACCTACGG CGTACCGATC
GAGAAGATCT ACGTTTCCGG AGCGAATAAC TTCGCTGCTG TGGCGCCGCA TCTTGCGCAG
GAGACAGCGG CGGAAATAGA AGAACTCGAC AATCCCGTGT TCGCAGGACT GAATCCCGGC
ACATTGCCGA AGAGCATGCT TGCTGGCGTG CTGGGTTCAG GGCTGACGAA GACCAGGATC
AACCTGGCGA GCGAGCCCTA CGAGGACGCG AAGCTGTACC TGGCGCGCTT CGGGACAATT
GCTGCAGCGC TCTTGCTGGT TGCGGTGGGG CTGTTGTGGT TCACGATTCA CAGTGTTCGG
CGCTCGAGCG ACATCAATCG GAAGTTGTCT GCCGTGCGAG GGCAGATCGA CACACTCGAC
CGGGAAAGGG TCCTGGCCGA GAAGATGCTC GCCCTGCCGC AGAACCATGG CACTGTGAAT
AAATCGGAGT TCTTGAACAG CGTCTTTGCG CGTAAGGCGT TTTCGTGGAC GACGGTTTTC
TCCGACATGG AGAAGATCAT GCCGCCGGGG CTGCACGTGG TTTCGATCGC GCCGGAACTC
GACGCGCAGA ACCAATTGAA AGTACAGATT GTTGTGGCAG GTGAGAACCG AGACCGGGCG
ATCACACTGG TGCGAAATAT GGAGCAGACG CCGCGGTTCC GCGATGTGAT CCTGCGAAGC
GACATACAAA ATACGGTGCT AGGTGGCACT AGCGCTGAAG ATCGCGATCC GATCCGTTTC
GACATCGTTG CGCAGTATCT GCCTTCGGCG CCGAATCCGG CGCCACCAGC GAGCGGAGTC
GCGTCGAAGG CAGAAGATGC GCCGTCCGCA GCGAGCGCAG AGCCTAAGGC TGCCGAGGCA
TCGGCAGGTC CGGCGCCAGC ACAACCTGCT GCCCAGCCGA AGCCGCAGAC CGTCGCCAGA
AAGGCGGGAG CACAACGATG A
 
Protein sequence
MLKGNNQGGR PLLACEITRT QVFATRWAEK TTGVEVLQVR TIPGGVSPNL TSQNVSDAGA 
LKSVVADALQ ASGARTKDVT LIVPDAAVRV ALLDFDTLPE KKQEADAVVR FRLKKSLPFE
VDRAAISYHA QPNGTTLRVL VCVMLNSVLH EYESAVRDAG FLPGVVLPST LAALGNVSVD
APTMVVKIAD GTTTIAILDQ GRLQLYRTLD HGSPDVEPAS LAHDIYPSVV FFQDTYGVPI
EKIYVSGANN FAAVAPHLAQ ETAAEIEELD NPVFAGLNPG TLPKSMLAGV LGSGLTKTRI
NLASEPYEDA KLYLARFGTI AAALLLVAVG LLWFTIHSVR RSSDINRKLS AVRGQIDTLD
RERVLAEKML ALPQNHGTVN KSEFLNSVFA RKAFSWTTVF SDMEKIMPPG LHVVSIAPEL
DAQNQLKVQI VVAGENRDRA ITLVRNMEQT PRFRDVILRS DIQNTVLGGT SAEDRDPIRF
DIVAQYLPSA PNPAPPASGV ASKAEDAPSA ASAEPKAAEA SAGPAPAQPA AQPKPQTVAR
KAGAQR