Gene Acid345_2836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2836 
Symbol 
ID4071839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3373576 
End bp3374847 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content60% 
IMG OID637984854 
Producthypothetical protein 
Protein accessionYP_591911 
Protein GI94969863 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC GAATGTCGGC GATCCAGACA GGGCCAGACA GCAAGGCCCT GGCAAAACAG 
TTCTCGTCCG TGCGCTCTTT CAGCGAGCGA CTGGTCGCCC ATCTCGCGCC AGAGGACCTG
ATGGTCCAGT CCATGCCGGA CGCAAGCCCG GCAAAGTGGC ACCTCGCCCA CACCACGTGG
TTTTTCGAGA CTTTCCTGCT TGCTGAGTTC CAGCCCAGCT ACAAAGCCTA CGACCCGGCT
TTTCGGGCGG TCTTTAATTC CTATTACAAA GGCGTCGGAA AGCATCCGGT GCGCGGGATG
CGCGGCACAT TTTCGCGTCC CACGCTCGAT CGCGTGCTCG CGTATCGGGT CCACGTGAAC
GCCGCAATGG AGCGGCTGAT CGATTCGGAT CTGCCGGAGA GCGCGAGAAC TCTAATCGTC
CTCGGCCTCA ATCACGAACA GCAGCACCAG GAACTGATCG TCACCGACAT CAAGCACGCC
TTCTGGACCC AGCCCTTGCA GCCCGCGTTC GTTGAATCGT CAGACGAAGA AAGTCGTTCT
GCTCCTCCGC TCACCTGGTC GGCATTCGAC GGTGGGGAAG TCGAGATCGG GCATACCGGC
TCCGGCTTCT CCTTCGACAA CGAAGAGCCT CGGCATCGCG TTCTTTTGCA GCCATACAAG
TTGGCGAATC GGCTGGTCAC CAATCGCGAG TACTTAGCAT TTATGCAGGA TGGTGGTTAC
CACCGGCCTG AGTTGTGGCT CTCCGATGGT TGGGACACCG TCAATGCGCA GGGATGGGAA
GCGCCGTTTT ACTGGGATCG CGACGGACAG CAGTGGCGTG TCTTCACCGC CGCTGGAACG
AAGCCGGTAA ATCTCGATGA GCCGGTTTGC CACGTCAGCT TCTACGAGGC CGATGCTTAC
GCACACTGGG CCAATGCGCG GCTGCCGCTC GAAGCCGAGT GGGAACACGC TGCGGCATCT
CAGCCGATAC GCGGCAACTT CGCTGAGTCC GGACGATTTC ATCCAACCGT TGCGCCATCC
GCGGATGCGC CACAGTTCTA CGGCGACGTT TGGGAGTGGA CCGCCAGCCC ATATGTTGGC
TATCCGGGAT TTCAGCCGGC AGCGGGCCTG GTCGGCGAGT ACAACGGCAA GTTCATGTGC
AATCAGTTCG TGTTGCGTGG AGGCTCGTGC GCCACGCCGC AGTCTCACAT TCGGGCCAGC
TATCGCAATT TCTTCCCGCC ACAGGCTCGG TGGCAATTCA TGGGAATCAG GTTAGCGGCC
AATGCACGTT AG
 
Protein sequence
MSERMSAIQT GPDSKALAKQ FSSVRSFSER LVAHLAPEDL MVQSMPDASP AKWHLAHTTW 
FFETFLLAEF QPSYKAYDPA FRAVFNSYYK GVGKHPVRGM RGTFSRPTLD RVLAYRVHVN
AAMERLIDSD LPESARTLIV LGLNHEQQHQ ELIVTDIKHA FWTQPLQPAF VESSDEESRS
APPLTWSAFD GGEVEIGHTG SGFSFDNEEP RHRVLLQPYK LANRLVTNRE YLAFMQDGGY
HRPELWLSDG WDTVNAQGWE APFYWDRDGQ QWRVFTAAGT KPVNLDEPVC HVSFYEADAY
AHWANARLPL EAEWEHAAAS QPIRGNFAES GRFHPTVAPS ADAPQFYGDV WEWTASPYVG
YPGFQPAAGL VGEYNGKFMC NQFVLRGGSC ATPQSHIRAS YRNFFPPQAR WQFMGIRLAA
NAR