Gene Acid345_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0416 
Symbol 
ID4068735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp478185 
End bp480206 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content60% 
IMG OID637982420 
Producthypothetical protein 
Protein accessionYP_589495 
Protein GI94967447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0810829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCGC AATCCTGTTG CATCGCTCAT GCTCTCGACA TACAATCACC GCCTTCCCAC 
CGGCTCCAAG TCCCGACTTT AGTTTCAATT TCCCGTTGCG AGGAGACTGC GATGCGCAAG
TTGGCAGGAA TCACTGTCGT GCTGACCGTT GTATTTGCCG CGTGGAGCGC GCTGGCCGCC
ACACCAGCGA CGATGGTCAC CCCGGTTGCC GGCTCGAAGT TCGCAGGGGC GAACGTGACC
TTCACCTGGA ATGCGGGCGC AGGAGTCTCG CAGTACTCGC TTTACATAGG GACAACGCCG
GGGGCGCACG ATCTGGCTTT CGTGAGCACC GGAGTGTCGA CGACGACGAC CGTGAACGGT
CTGCCCACGG ACGGGCGCAA CATTTATGTC ACGTTGTATT CGCTGATTGC CGGGGTATGG
CAGGGGAATC GCTACAGCTA CTTCGCCTCG GGTGCCGGTG TCGCCGCAAC AATGACCGCT
CCGGCTGCCG GTTCGAAGTT GGCGGGCGCG AGCGTGACGT TTTCATGGAA TACGGGGGCG
GGTATTTCGC AATATAGCCT CTACGTCGGG AATACGAAGG GAGCACACGA CATCGCGTTC
TCAAGCGGCA ATGTGACGTC GAAACTGGTC AACGGCCTTC CGACGGACGG ACGGATGGTT
TACGTCACGC TGTACTCCCT GAACGGCTCG ACCTGGCTGA GGAACTACTA CACATATGTC
GCGTCGGGAG TCGGCGTGGG GGCGGTAATG TCGTCGCCAG CGCCTGGATC TACCTTCGCA
AACTCTGCAG CGAACTTTTC GTGGACGAAC GGGACCGGCG TCTCTGAGTA TTCGCTTTAC
GTGGGGAGTA CACCGGGGGC GCATGACATT GCCTATGTGA ATGCCGGAAG CATCCCGTTA
GCCACGGTTA CGAACCTGCC GACCAATGGG TCAACCGTGT ATATCAACCT CTATTCGCTG
AATGGGGCGA CTTGGCTGAG GAACAGTTAT ACATACACGG CTGCGGCCGC GCCCTCAAAG
CGGGTAGCCT GGATTCCCGA CTTCTACGGC GAGACATTGC AGGTGCGGAT CGGCACTGGC
GCCGGTGCGA TCGCCACCAG CGTCAACCTG CCCACATGCA ATCCGAACAG CGTCGCGGTA
AACAGCGATA AGGCATACGT CGTGTGCTCG GCCTTCGAGG CGAATCCTGA CAAGATCCTG
GTGTACGACG CGACCGTGAT TCGTGCCTCG GCGGGGGGCG TATTGGCGAT TAGTCCGACG
AAGACGATCA CGAGCGCGCA GTTCAACTCG CTGATCGGAA TCGCCTTTGA CGCTGGGAAC
AACCTCTGGG TGGCGAGTTA CGGAAACCAT CAGATCAACG AGATCACCGC TGCGGAACTG
GCGAAAGCCT CGCCTACCGC TACGGCGGAG TTGGTTCACT CTCCTGACAA TCCGGTAACG
CTCACCTTCG ACAGTTCCGG AGGCATGTGG GTGAGCGGGC AGTACTCGGG CGGAATCGTG
CTGCACTTCC CGAGCAGCCA GATCCACAGC GGCTCCGGCG CAACTCCTGA CTATTGCCTG
GCGACGACGG ACCTCGGGGC GGGATGCCAG TTCGTGGACG GCATCTTCTT AAACCCGGAG
GGGCTCGCTC TCTATAACGG GGACGTCTGG GTGGCGAACA ACGCTACGGG AGCAGCCGGT
GAGGTCCCGG GACGACAACT CGTGGACTTG AAATTCAATG CCGGGAACGT GACGGTGAAC
GGTACGTTCG GTGATCCAAC TGCCGCTGCG AAGAGCCCGT TCGTCTGTCC GGGCGGACTG
TTCGCGGGAG CAATCCATCT TTGGATCAAC GACGAGAGCT ATGCCGAGGC GGATCCGCAG
TGTGGCGCCA TGGGCGACGT ATCGGCTGCA ACTGGCGGTG TGTTTGCGTT CACGCCGGCA
CAACTGGCCG CCCGGAGCAC GTCCACGAGC CAAGTGCTGC CGTATTCCGG GGTTACCGGA
AGACCAGGAT TCGGGGGCAT TTTTGTCGAG AAAGACCAGT AG
 
Protein sequence
MRAQSCCIAH ALDIQSPPSH RLQVPTLVSI SRCEETAMRK LAGITVVLTV VFAAWSALAA 
TPATMVTPVA GSKFAGANVT FTWNAGAGVS QYSLYIGTTP GAHDLAFVST GVSTTTTVNG
LPTDGRNIYV TLYSLIAGVW QGNRYSYFAS GAGVAATMTA PAAGSKLAGA SVTFSWNTGA
GISQYSLYVG NTKGAHDIAF SSGNVTSKLV NGLPTDGRMV YVTLYSLNGS TWLRNYYTYV
ASGVGVGAVM SSPAPGSTFA NSAANFSWTN GTGVSEYSLY VGSTPGAHDI AYVNAGSIPL
ATVTNLPTNG STVYINLYSL NGATWLRNSY TYTAAAAPSK RVAWIPDFYG ETLQVRIGTG
AGAIATSVNL PTCNPNSVAV NSDKAYVVCS AFEANPDKIL VYDATVIRAS AGGVLAISPT
KTITSAQFNS LIGIAFDAGN NLWVASYGNH QINEITAAEL AKASPTATAE LVHSPDNPVT
LTFDSSGGMW VSGQYSGGIV LHFPSSQIHS GSGATPDYCL ATTDLGAGCQ FVDGIFLNPE
GLALYNGDVW VANNATGAAG EVPGRQLVDL KFNAGNVTVN GTFGDPTAAA KSPFVCPGGL
FAGAIHLWIN DESYAEADPQ CGAMGDVSAA TGGVFAFTPA QLAARSTSTS QVLPYSGVTG
RPGFGGIFVE KDQ