Gene Acid345_4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4220 
Symbol 
ID4073146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4998539 
End bp4999627 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content60% 
IMG OID637986251 
Producthypothetical protein 
Protein accessionYP_593294 
Protein GI94971246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00877371 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0608426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC GCGATACTAA TCGCATTGTC CGGCCCTTTG AATGGGGGCA GGAATGGACG 
CGTGATTTTC CCGGCGCGGA CCGCCTCCAG TGGGGCGAGA CCGCGGCTGA ACACTTCGAT
TACTTCACGG AGTTGAATCG CCACATCGTC GAACACAGCG ACGAGTTTTT TTCTTACAAG
ACGCCGAGCG ACTATCGCCT CGAGAAGCGT CGGGTGCAGG TGTTCTTCAC CGGATCGGGA
GAGCCGCCCA AGGATCCGGA TGAGACTGGC ACCTATCTGC GCTTCACTTC GCCGCATCCG
TCGCCGTATC TGGAGAACAA CGTTTTCAAT GCGCGCTGGT TCCCGGCGAG AGGGAAGCGG
GCGATTATCG TGCTGCCGCA GTGGAACGCC GATGGCATCA GCCACAACGG CTTCGCACGC
ATCTTCAACC CGATGGGCAT TGCGGTTCTG CGTATGAGCA AGCCGTATCA CGATATTCGG
CGGCCGGCGG AGTTGCACCG CGCCGACTAT GCGGTGTCGT CGAACGTCGG GCGCACGATT
CATGCGGCGC GGCAGGGTAT CACCGATATT CGCGCCGCTC TCGATTGGCT CAACTCTGAA
GGCTATACGC AGCTGGGAAT CCTAGGCACG AGTCTTGGTT CCTGCTATGC GTTCATCGCG
AGCGCGCATG ACGAGCGGCT GCGGGTGAAC GTCTTTAATC ACGCTTCGAC ATACTTTGGC
GATGTGGTTT GGACGGGGCA GTCGACCCGT CACGTGCGCG CGGGGATTGA AGAGGTCGGC
CTCGATATGG ATGCGTTGCG GAAGATATGG CTGGCCGTCA GCCCGATGGC GTTCTTCGAT
AAGTTCGAGC GCTGGCAAAA GAAGTCGCTG ATGATCTACG GCAAGTACGA CCTCACGTTC
CTGCCGGAGT TCTCGCAGCA GATCGCCGCC GAGTTCAAGC GCCGTGGATT AGACACGCTA
GTGAAAGCGC TGCCGTGCGG ACACTACTCG CTGGGCGAGA CGCCCTACAA ATACATGGAC
GCGTGGCATA TTTCGCGGTT CCTGCGGCGA GCGTTTGGGG CGCATATGCA GACACAGCAC
GCGGTGTAG
 
Protein sequence
MTTRDTNRIV RPFEWGQEWT RDFPGADRLQ WGETAAEHFD YFTELNRHIV EHSDEFFSYK 
TPSDYRLEKR RVQVFFTGSG EPPKDPDETG TYLRFTSPHP SPYLENNVFN ARWFPARGKR
AIIVLPQWNA DGISHNGFAR IFNPMGIAVL RMSKPYHDIR RPAELHRADY AVSSNVGRTI
HAARQGITDI RAALDWLNSE GYTQLGILGT SLGSCYAFIA SAHDERLRVN VFNHASTYFG
DVVWTGQSTR HVRAGIEEVG LDMDALRKIW LAVSPMAFFD KFERWQKKSL MIYGKYDLTF
LPEFSQQIAA EFKRRGLDTL VKALPCGHYS LGETPYKYMD AWHISRFLRR AFGAHMQTQH
AV