Gene Acid345_0863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0863 
Symbol 
ID4068957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1074697 
End bp1076325 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content58% 
IMG OID637982872 
Productphage tail sheath protein 
Protein accessionYP_589942 
Protein GI94967894 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00698739 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGA CTTTCACGTT TCCTGGCGTT TACATTGAAG AAATTCCCAG TGGAGTGCAC 
ACCATCACAG GCGTCGCTAC CTCCATCGCT GCCTTCGTGG GCTGGGCAGC GCAGGGCCCG
ACCGATGAAG CCACACTGGT CCAGAGTTGG GCAGACTTCG CGAACCAATT CGGCGGCCTC
GACGCCCGAA GCAATCTCGG CTACTCCGTC AATCAGTTTT TTAACAACGG CGGACAACAG
GCCTACATCG TGCGACTCGT CTCCGACACC ACGAACGGCA ACACGGCGGC TGCGACGGCG
TCGGTCAACA TCAAGACCAT AACCTTCGAC GCGAGCGTGT CACCCAGCAA AGTCACCGTT
ACGAAAGGTG CCGCCGGATT GACAATATCG GCCGCAAACC AAGGCGCATG GGCGAAGAAC
TACTCCATCC AGGTCCAGCC GCGAATTGAC GATTACAACC GCTTTACTCT CTCGGTTGTC
TACACCGATC CCGTCACGTC TGCGCAGACC ATTGTTGAGA GCTATTCGAA TCTCTCGACG
AACTCTGCCG ACACGCAGGG ACGCTACGTC GTCAACATCC TGAACGAACA GTCGAACTAT
GTAACGGCGA AGATGGCCCC AACCCCGGTC ACGCTGACCG TCACTCCCGG CGTTCCGACC
ACGCCGAAAG CCTCGAATCC CGGCTCTATC GCATTGAATG CCAGCGTTGA CGGAAACGAC
GGCACGCCGC TGGCACCGGG CGACACCGTC TTCGAGAAAA TGCTCAATTC GGGCGGAGCC
GGAACCGCAG GTGTTCGGTT GCTCGATACC GTTCCCATCT TCAACATCCT TTGCGTTCCC
GGCGAAACGG TGGTCCAGAA CATCACCGAA CTGCAAGCGT ACTGCGTGGA CAACCGCGCA
TTCCTAATCG TAGATTCCAA GTCGGACGAC AAGGTGAAAG ACCTGGCACT AAACGGTCCG
GCTGGCATTA CTGGCGTAAA CTCCATCAAC TCCGCGCTCT ACTTTCCGTG GGTCAACCAG
TTCGACTCGC AAACCAATAG CACTCGCGCC TTTCCACCCT GCGGCTTTGT TGCGGGCCTC
TATGCGGCGA CTGACACAGC CCGCGGGGTT TGGAAAGCGC CTGCCGGCAT CGACGCCAGC
CTCACTGGTG ACACCGGTCT CACGCTCAAT CTCACGAACG CGCAGAACGG AAGCTTAAAT
ATCCAGGCGA TCAATTGCCT CCGCAACTTT CCTGTGTACG GCGACGTCAT TTGGGGTGCG
CGAACGTTGC GCGGGAACAA CCAGGTCGGC TCCGAGTGGA AGTACGTTCC CATCCGGCGT
CTCGCTCTCT TCCTCGAAAG CTCGTTGTAC GACGGCACCC AGTGGGTCGT CTTCGAACCC
AATGACGAAA AGCTCTGGGG ACAGATCCGC ATGAACGTGG GTGCCTTCAT GCAGGGCCTC
TTCCTGCAAG GCGCATTCCA AGGCACCTCT CCGCAACAGG CCTACTTCGT CAAATGCGAC
GCCGACAACA ATCCGCAGTC GAGCATTGAT CAGGGCATCG TCAACATTCT CGTCGGATTC
GCTCCGCTCT ACCCCGCAGA ATTCGTCGTA ATACAGATCC AGCAGATGGC AGGACAGCTT
CAGGCGTAA
 
Protein sequence
MPPTFTFPGV YIEEIPSGVH TITGVATSIA AFVGWAAQGP TDEATLVQSW ADFANQFGGL 
DARSNLGYSV NQFFNNGGQQ AYIVRLVSDT TNGNTAAATA SVNIKTITFD ASVSPSKVTV
TKGAAGLTIS AANQGAWAKN YSIQVQPRID DYNRFTLSVV YTDPVTSAQT IVESYSNLST
NSADTQGRYV VNILNEQSNY VTAKMAPTPV TLTVTPGVPT TPKASNPGSI ALNASVDGND
GTPLAPGDTV FEKMLNSGGA GTAGVRLLDT VPIFNILCVP GETVVQNITE LQAYCVDNRA
FLIVDSKSDD KVKDLALNGP AGITGVNSIN SALYFPWVNQ FDSQTNSTRA FPPCGFVAGL
YAATDTARGV WKAPAGIDAS LTGDTGLTLN LTNAQNGSLN IQAINCLRNF PVYGDVIWGA
RTLRGNNQVG SEWKYVPIRR LALFLESSLY DGTQWVVFEP NDEKLWGQIR MNVGAFMQGL
FLQGAFQGTS PQQAYFVKCD ADNNPQSSID QGIVNILVGF APLYPAEFVV IQIQQMAGQL
QA