Gene Acid345_0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0178 
Symbol 
ID4073065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp189110 
End bp190375 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content61% 
IMG OID637982178 
Producthypothetical protein 
Protein accessionYP_589257 
Protein GI94967209 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000132954 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.337224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCC ATCGTTTGGT TCGCTTGCTT GTGCTCGCGT TGATGCTGAT GAGCATCCCG 
GCACTTTCTT TCGGAGGCGT GTTCGTTTCC GTTTCGGTTG GACCGCCGCC GATTCCCGTC
TATACACAAC CGTTATGCCC GGGCGCTGGC TATATGTGGA CGCCTGGCTA TTGGGCTTGG
GGTGACGAAG GCTATTACTG GGTTCCGGGT ACGTGGGTGA TGGCCCCGAC TCCCGGTTTC
CTGTGGACGC CTGGCTATTG GGGCTGGGGC GGCGGCGCCT ACCTATGGCA CGGCGGATAC
TGGGGCCCGC ACGTCGGCTT CTACGGCGGT ATTAACTACG GTTTCGGCTA CGGCGGCGTC
GGCTACGGCG GTGGCTACTG GCACGGCAAC AATTTCTACT ACAACCGCAG CGTGAACAAC
GTGAATGTCA CGAATGTGAC GAACGTCTAC AACAAGACCG TCATCGTGAA CAACAACAAC
CACGTGAGCT ACAACGGCGG GCACGGCGGC GTGACCCGGC AGCCCACTTC ACAAGAGCGC
CAGTGGCAGA ACGAGAAACA TGTTGATGCC ACGAGCGCGC AGCAACAGCA CTTCCAGGAA
GCGGGACGCA ACCCGCAGCT TCTGGCGAAG AACAACGGCG GCAAGCCGGC GATCGCTGCG
ACCGCACGCC CAGCGGACTT TAAGTCTGCG GTACCGGCAA AGGCAGTGGG TGGCCCAATC
AACAAGACGG CTTTGACGGC GACTCCGAAG AACATGCCTG CCCCGAAGAG CAATGCGGCT
GCGACGGCAA ATGGTAACGT CAACGCCAAT GCGAAGGGCA ACGCAAACAT TCCGAAGCCT
GGCAATGCGA GTGCGAGCAC GAACTCCAAG GTCGGCACGA ATGCGTCCGC GAACACCACG
GCGCACAATG TTCCGAAGCC ACCGGCGGCG AGCAGCAATA CACGAGACGT GAACACGGCG
CACACGAACA CGACAGCATC GCCGAGCACG AGCACGCATA ATGTTCCGAA GCCGCCAAGC
ACGAATGCGA CGTCGACGAA CCGTAGCAAC GCGTCGGTGA ATTCGCCGAA GACGTACTCC
TCGCAGCCGA ATACGGCTTC ACACCAGAGC CAGCCGGCGC CTCATTACAG CGCTCCAGCG
ACTCATAATC CACCGCCGCA AACGCACGCG GCTCCTCAGG TGCAACATAA TGCGGCGCCA
GCACAGCACA GTGCTCCTCC GCAGCACAGC GCACCGGCGA CTCACGACAA CAAACCAAAG
CGCTAA
 
Protein sequence
MSTHRLVRLL VLALMLMSIP ALSFGGVFVS VSVGPPPIPV YTQPLCPGAG YMWTPGYWAW 
GDEGYYWVPG TWVMAPTPGF LWTPGYWGWG GGAYLWHGGY WGPHVGFYGG INYGFGYGGV
GYGGGYWHGN NFYYNRSVNN VNVTNVTNVY NKTVIVNNNN HVSYNGGHGG VTRQPTSQER
QWQNEKHVDA TSAQQQHFQE AGRNPQLLAK NNGGKPAIAA TARPADFKSA VPAKAVGGPI
NKTALTATPK NMPAPKSNAA ATANGNVNAN AKGNANIPKP GNASASTNSK VGTNASANTT
AHNVPKPPAA SSNTRDVNTA HTNTTASPST STHNVPKPPS TNATSTNRSN ASVNSPKTYS
SQPNTASHQS QPAPHYSAPA THNPPPQTHA APQVQHNAAP AQHSAPPQHS APATHDNKPK
R