Gene Acid345_3272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3272 
Symbol 
ID4072684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3875888 
End bp3877537 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content62% 
IMG OID637985293 
Producthypothetical protein 
Protein accessionYP_592347 
Protein GI94970299 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000123414 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATGG ATAAAGAAGT GGCTCCCAGC CGTAAGTTAA TCGCAGGGGT GCTGCTCATG 
ATGCTCGTTG TGTCGCTTTC GGCGCTGGCG CAACAGCAAA CAGCAGCACC GTCTGCCAGC
ACCACCACCT CAGCCCCCCA CAACGAATTA TTCGCTGGCT ATTCCTGGTA CGACCCGCGT
GGCTATTACA CGGGAGATGT CGCTGGCGTG GCCTTCAAGG CTCCCTCGAT CACACCAGGT
TTCGCCGCAG CCTATACGCG TAACTACGGC AACGTATTCG GCTTCACCGT CGATTACAGC
GGTCACTTTG GCGATGCCAA CCACGTCAAC ACGTTCCTCG TGGGACCGCA GTTGAAGTGG
CGTGCTGAGC ACTTCCAGCC CTTCGGCCAG ATACTGGCTG GCTTGGCTGT CATCAGCGCT
CCCAATACCA TTCGCGGAAC GCAGTACCAG GGCGCAGTTG GCGCAGGCGG CGGTTTCGAC
CTTCTGCTGA CTGAGAAATT CGGCATTCGC CTCCTGCAAG CCGATTACAT CTACACCAAT
TGGGAACAGG GCGCGGCCAC GGGTACTCCG AGCCGTTGGA ACTCCGTACG CCTCCAGGGC
GGCCTCCTTT ACCAGTGGGG CTTCAACCCG ACCGTTCCGG TTTCGGCCGC TTGCAGCGCG
CAGCCTTCGT CGATCATGGC GGGCGAGCCT GTGAAGGTCA CCGCTACGGG ATCTAACTTC
AACCCCAAGA AGACCGTCAG CTACGCGTGG ACGAGCACCG GTGGCAAAGT CAGCGGCACC
GACGCAACCA CCACGGTTGA CACCAACGGT CTTGCCCCGG GTACCTACAC CGTGAAAGCG
ACGCTCAGCG ATGGCGCGAA GAAGAACCCG AACGTCGCAG AGTGCAACGC GACCTTCACC
GTCAACGAAC CGCCGAAGCA TCCGCCGACA ATTTCTTGCA CGGCGAATCC GTCGACCGTT
CGCGCGGGTG ACGCTTGCAA CATCGCTTGC AACGGCAACA GCCCGGATGG ACGTCCGTTG
ACCTACACCC ACAACGCGAC CGGTGGCCGC CTGACGCCTG ACGGCGCCAA TGCGACCCTC
GATACGACGG GTGCAGCAGC GGGTCCGATC ACCGTTAACA GCACCGTGAG CGACGATCGC
GGCCTTACCG CCTCGACTTC GTCTTCGTGC AGCGTGGAAG CTCCGCCGGC CGCTCCGACC
GCGAGCAAGC TGAACGAAAT CACCTTCCCG AACGAGAAGA AGCCAGCTCG TGTGGACAAC
ACCGCCAAGG CGATCCTCGA TGACGTTGCG CTGCGCCTGC AGCGTGAGCC GTCGTCCAAG
GCCGTCGTGG TTGGTTACGC CACCGCGGAA GAAACCAAGA AGAAGGCCAA CGCAAACCTC
GCCGCACAGC GCGCCGTTAA CACCAAGGCC TGCCTCGACG GTGAAGAGGT ATCTTGCGAG
AACCAGAGCA AACAGATCGA CCCGAGCCGC ATTGAAGTTC GCACCGGCAC GGGCGACCAG
AACAAGGCCG AGATCTGGAT CGTACCGTCC GGCGCCAGCT TCACCGGCGA AGGCACGACC
CCGGTCGACG AAAGCAAGTT CAAGGCGCAG GCTCGTACCG CTGCCGGCGC TAAGGCTGCC
AAGAAGGCCC CGAAGAAGGC TGCGAAGTAG
 
Protein sequence
MSMDKEVAPS RKLIAGVLLM MLVVSLSALA QQQTAAPSAS TTTSAPHNEL FAGYSWYDPR 
GYYTGDVAGV AFKAPSITPG FAAAYTRNYG NVFGFTVDYS GHFGDANHVN TFLVGPQLKW
RAEHFQPFGQ ILAGLAVISA PNTIRGTQYQ GAVGAGGGFD LLLTEKFGIR LLQADYIYTN
WEQGAATGTP SRWNSVRLQG GLLYQWGFNP TVPVSAACSA QPSSIMAGEP VKVTATGSNF
NPKKTVSYAW TSTGGKVSGT DATTTVDTNG LAPGTYTVKA TLSDGAKKNP NVAECNATFT
VNEPPKHPPT ISCTANPSTV RAGDACNIAC NGNSPDGRPL TYTHNATGGR LTPDGANATL
DTTGAAAGPI TVNSTVSDDR GLTASTSSSC SVEAPPAAPT ASKLNEITFP NEKKPARVDN
TAKAILDDVA LRLQREPSSK AVVVGYATAE ETKKKANANL AAQRAVNTKA CLDGEEVSCE
NQSKQIDPSR IEVRTGTGDQ NKAEIWIVPS GASFTGEGTT PVDESKFKAQ ARTAAGAKAA
KKAPKKAAK