Gene Acid345_4711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4711 
Symbol 
ID4070650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5572049 
End bp5575039 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content57% 
IMG OID637986756 
Producthypothetical protein 
Protein accessionYP_593785 
Protein GI94971737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAAGAC TCTTTTCCTT CGCGGGCATA CTTTCTTTTC TACTTATCCT GGCAAATTGC 
GGTGGTGGAA GCGGCAACAC GCAACCTCCG GCGGGTGGCG GCGGTGGTGG CTCAACTCCG
AGCTTCACAG TCTCCCTTTC CATTTCCACA GTGAACCTGA CACCGGGCGG AGCGACACAG
GATGTCACTG TCTCGGTGAC GGGAAAGGGC GGCTACTCCG GATCGGTATC GGTTAGCGCT
ACCGGTCTTT CTTCCGGGGT GACTGTTTCG CCGACATCGG TGTCTGTTCA GACGGGAAGT
AGTGGGAAGC TGACCTTCTC GGCCTCAGAT TCAGCGACAG TCGGAGCGCA ATCGGCAAAC
ATTGAGGCAG TCGCAGGATC CGTCAAAGTT TCAAGTCCCG TGCAGATCAA CGTGGCGAAG
GCTGCACGTC CTGATCGATT CCACTCTGTA GGTGGAACTT TATGGCGCGG ATTTTACGAC
GATACTCGAG GGTTACTCTT CGCAGCGAAT CCAGGGTTGA GTGAAGTCGA TGTGATATCC
GGTTCTGATT TCACGATCAA AGCGAGAGTC TCGGTACCGC AGGCGTGGAG CGTTGACCAA
ATGGCGGACG GCAAGACGCT CGTTATCGGC ACGGTTGCTC AAGAGTTCTT CACGCTGGAC
GAAGATACTC TCAAGGCGAC GATTCATCTG CTGCCGACAT TGCCGCGTAT CTCCTATAGC
TTGAATTGTC CATCCGTTGT GGCGATGGCC AATGGCATCG TCTTTCTGCT GACCCAGGAA
ATGGGAATCG CGGGAGGTGG CGCAGATGGC GCCGCTCACT TAATAAAGTG GGATTCAAAG
AAGAACACGT ATACGGAACT TGGCCCTCTG AACGGAGCTT CGAGTTGGTC CACGAAGAGC
ATGGTTCGAA GTGCGGACCG CAAATGGGCG GCTTTCGCGG TGGACAAGTT CTATTTGTAT
AGTTCGGACG ATGACAGTAT TACGTCAGTC GTGGATCTGG CGACGGTCAA CCCGCCCGCC
GATTCGTTTG GTGTTCGGGG TTACGCGCTG AACGCTGACG GAAGCAAGAT TGCTGTCGCT
TCCGCATCGC AAGTCACCTT CCTCGATCAC TCATTCAATG TACTCGCGAC AGTACCGTAT
TCATCCGCTT TTCAGGATTC AGGCACCACC GTGCGATTCA CTGCGGACGG GAACCGCCTC
ATCATGCAGA ATATTTTCCC GGTCTCGCTT GAGATGGTCG ATGCGAACTC CTATACGGCG
CTCGGATACC AACCTGCTTT CGGAGACAGA GCAGATGTTT ACTCGACGAT AATCGCGATC
GACGGCGTCG GACGGGCCTT TGTCGGATTC GATGGCGGAT ACGAAGTAAT CGATACAGCG
CAGACACCGG TTCCAAATCC GACCGAAGCG GGCGCTACCT TGGACGGGCC GGAATGCCCC
CTCCCGAACC CGCCCAATGC AGGCCTGAAC GCGAGCCTGA CATATTCCAC GTTTAACAGC
AGTCTTTTTG CCGGCTACTC CTTCTACTTC GCTGGAGCGG CCGGAACCGT GTCCGCAGAC
GGCACCCAAG TGACTGCGCC GGCGTCTTCC CAAGCAGGCC CGGTTGATGT GGAGTGCGTC
GATTCGGCAG GGAGTTCCAG GACGCTGCCA TTTGCATTCT CCTACGGGGC AAAGGCTGCT
GCGGTGAGCG CAAACCTCTT ATCTCCCGTG GGAGAACAAT CTTTGTATGC GTTCGGTTTT
GGTTTTTTCT CCGATGCATC CTCCGTTCCG GCGGTGTCGG TAGGCGGGCT AGCTGCCGCG
AACGTTGAGC AGGTTTCGCT TTCGAAGGGC AGCTTACAAG GAGTTCGTCT GCAACCACCA
ACGCTCTCCG CTTCCACAAC CGCGGATGTC ACTGTGACAA GCGCGTACGG ATCGAGCACG
GTCAAGGGCG CGGTTTCATA CATCCCATCC GCGCACGTGG TAGCGACGAG TGGAGTGTTG
CAACTGCTCT TCGATTCGCA TCGCAACTTG CTCTATGCCC TAAAGGCCTC CGAGATCGAT
GTCCTGGACC CGATCTCGCT GACGTGGAAC ACGCCGTTTG CGCTGCCGGC ATTCGCTGGT
TCTGCGAACT ACGGATATAT AGCGCTCAGT CCGGATGGAA GCCGATTAGT GGCTGTCGCC
AGCGCAGGCT ACGCCGCTGT CGTCAACCCT GACGACCCAT CGAAAACCTT CTCGGTCTCG
ACGCCGAACC CTGGGTTCTC GTGGGGCAGG GTCGTGATAA CCAAGGAGAA CAAGGCCGTA
TTCGGCGGAA GGCCGCCTGT TGAAATCGAT CTCGCAACAT CAACCGGGAA AGTTATTCCA
ACTTATCTAG GGTGGCTGAT CGCATCACCG CCGGACGGAA GTGTGATTTA TGGCATTGAC
ACCGGTGTCA CCACAGGGCA GGCGTACCGA ACTGACATCT CCACCTACAA AACGACGAGC
ACGCCGCAGT TTGGCGCTCA ATTCTGGTCT GACTTGGCCG TCTCAGCGGA CGGATCACAT
TTCGCTGGGA TCCTCGCGGA ATCAAATGGA GGCGACGTTA TAGGGTTCTT TGAGTCGGGA
CTTCATCTCG TCAATTTCAA TGAAGGTCCG CTGCTCAGCC CTGCAGACGA TTCTCTCGTA
TTGGGCTCAG TATTTGGCCC TAAAGGAAAC GTGCTGGTGG TCGCATTGGG AGACTCTATC
GAATTCTGGG ATACGCAGAC CGGTACTCTG CGCGCCCGGC TTATGACGCC TGAAGAGTTG
CAGACGGAAT CGGGTTCAGC CAGTTTCGCC GCACCGCAGG TCGCCCTAGA CTCGACGGGA
CAGACTATCT TCGCAGTGTC CGCGAGTGGT ATCAGCGCAA TGACTCTTCC GGTCCCGGTC
GATGACCTCC CGGTTGCGGC ATGGAACGGC CCGTTGCCGG CGCCTCAAGC TCCCGTGTCG
GCGGTTTTGG GGCCTAGACA CGGGAGCTAC ACAGTTCGTC GCAGCCGGTA A
 
Protein sequence
MQRLFSFAGI LSFLLILANC GGGSGNTQPP AGGGGGGSTP SFTVSLSIST VNLTPGGATQ 
DVTVSVTGKG GYSGSVSVSA TGLSSGVTVS PTSVSVQTGS SGKLTFSASD SATVGAQSAN
IEAVAGSVKV SSPVQINVAK AARPDRFHSV GGTLWRGFYD DTRGLLFAAN PGLSEVDVIS
GSDFTIKARV SVPQAWSVDQ MADGKTLVIG TVAQEFFTLD EDTLKATIHL LPTLPRISYS
LNCPSVVAMA NGIVFLLTQE MGIAGGGADG AAHLIKWDSK KNTYTELGPL NGASSWSTKS
MVRSADRKWA AFAVDKFYLY SSDDDSITSV VDLATVNPPA DSFGVRGYAL NADGSKIAVA
SASQVTFLDH SFNVLATVPY SSAFQDSGTT VRFTADGNRL IMQNIFPVSL EMVDANSYTA
LGYQPAFGDR ADVYSTIIAI DGVGRAFVGF DGGYEVIDTA QTPVPNPTEA GATLDGPECP
LPNPPNAGLN ASLTYSTFNS SLFAGYSFYF AGAAGTVSAD GTQVTAPASS QAGPVDVECV
DSAGSSRTLP FAFSYGAKAA AVSANLLSPV GEQSLYAFGF GFFSDASSVP AVSVGGLAAA
NVEQVSLSKG SLQGVRLQPP TLSASTTADV TVTSAYGSST VKGAVSYIPS AHVVATSGVL
QLLFDSHRNL LYALKASEID VLDPISLTWN TPFALPAFAG SANYGYIALS PDGSRLVAVA
SAGYAAVVNP DDPSKTFSVS TPNPGFSWGR VVITKENKAV FGGRPPVEID LATSTGKVIP
TYLGWLIASP PDGSVIYGID TGVTTGQAYR TDISTYKTTS TPQFGAQFWS DLAVSADGSH
FAGILAESNG GDVIGFFESG LHLVNFNEGP LLSPADDSLV LGSVFGPKGN VLVVALGDSI
EFWDTQTGTL RARLMTPEEL QTESGSASFA APQVALDSTG QTIFAVSASG ISAMTLPVPV
DDLPVAAWNG PLPAPQAPVS AVLGPRHGSY TVRRSR