Gene Acid345_0332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0332 
Symbol 
ID4070094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp360199 
End bp363399 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content61% 
IMG OID637982335 
Producthypothetical protein 
Protein accessionYP_589411 
Protein GI94967363 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.635539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.188502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCT CGAAAGTTGA GCTTCAACTT TGTCTCGCGA TCGTCCTGTG CTTGCTTGCA 
TGCACTCCTC TGTTCGCCGC TGCGCCAACG TTGAACTCCA TCTCTCCAAA GTCCGCACCG
CTGAATACCG CGGTCACGCT CCAGTTGGTC GGAGCGAATT TCGCCTCGAA CTCGCAGGTC
TATTTCAACG GCAACGCCGT TCCGACGACC TTTAGCAGCA CTACGGTGCT GCAAGCTTCC
GTCCCCGCTG CGAGTGTGGC CACTCCGGGG AATTTTGCAG TGACGGTGAC GACGCCCTCC
ATGGGCACCA GCGCGGCGTT GATGTTCACC TCTTATGTCG CGCTGCCCAA CAACAGCATG
GCGTATAGCG CCGCGACAGG TCAGTTGTAC GTGTCGGTAC CGAGCACCGC GGGCATGCCC
TACGGCAATT CGGTGGTGGC GATCGACCCT GTGACCGGCG CGATCACGAA GTCGATCTTC
GTCGGCAGCG AGCCTAACAA GATGGCAGTG AGCGCCGATG GCACCGTGCT CTGGGTAGGA
CTCGACGGCA GCTCCGCTGT GCGCCAGGTT AGTCTGACCG CCGGCACCGC CGGGGCAAAG
ATTACGCTGG GCTCAAACAC CGGTACGAAT GCACCGCCGG TTGCGCTGTC TCTAGCGGCA
CTGCCGGGAT CGCCGAATTC GTTCGTGGTT TCCATGACCG CCCCGTTGGG CGGAACGGTT
GTCGCGATTT ACGACAACGC GACGCGCCGC GCGAATACGT GGAGTGCCTC GACTTACTCT
GGAAATGCAT TGCAGACGAA TGCAACGACC TCTGAGGTGT ATGTCGGCGG TCCGACCTAC
TACCAACCTC TTTCTTATAG CGCGACCGGA TTGAGCGTTC CGAGGCTCGG TTCCTCGGGC
AACTTCACCG GCAGCACAGA CGACCTGCAG GTTGTGAATG GCGAAGTCTA TACCGATCTT
GGCGCGCTCT ACGACGCGGA GACCGGTGCG CGGAATGGTT CGCTTCTGAA TGGTTCGAAC
CTCGCCGCGG GTCCCACTTT CACCGACACG CCGCTCGGCA AGACGTTTGT GTTCGACAGC
CCGACCGCGA ACAAGTACAC GCAGGTGCAG GTTTTCACCA CGAGCACTTC GGCGTTGGCT
GCAACCTTCC CGTTGAACCT CGCTTCGAAC ACCACCGGCA CGCCGTCGCA CTTGTTACGC
TGGGGCACAA ATGGCCTGGC AGTGCGCGAC AATGTGGCGA TCTATGCGTT CCGCTCTGCG
CAGGTCACGA ACCTCGCGGG GATCAATGCA GACCTCAGCG TCACCCTCGC GCAGAGTGGC
ACGCCGACCA CCGGTAATTC CATCACCTAC ACCGCGACGG TGAAGAACGC CGGACCGGCC
ACCTCTACGA ATGTCGCCTT CACGGCCCAG GCTCCTGCGA CAGCGAGCAT CGTTTCCATT
ACGCCGACCG TCGGCTCGTG TTCCAAGCTG AACGGCCTGA GCTGCAATCT CGGCAGTCTC
GCGACCGGGG CAACCACGAC AGTTACGGTT GTGGCAAAAC AGATGCTCGC CGGGTCCGTG
GTGCTCAACG CGCAAGTCTT TGGTTCGGAG AACGACCCCA ACCTGGCAAA CAACCAGGCT
TCGACGTCAG TTCTCACCAT CACCGGTAAC CCTTATAACG GGGTTCCGAC GATCACCTCG
ATTTCCCCCG CTGCCATTCA AGCGGGCTCG GGCACTACCA TGGTCACGGT TACCGGAACG
GGCTTCTCGA CGGCAGCGTC GATCTTGATT GACGGCACTG CACTCGCGAC GACCGTCCTC
AGCAGCACGC AAGCCACAGC TATGGTTCCT TCGACCAAGT TGGCGAGCCT GGGTTGGAGC
AAGATCAACG TTTCCAATCC AGCGCCGGGC GGCGGAGTTT CCTACGTGCT GCCACTTTCA
GTATTCAAAG TGCTGAGCGC AGGTGCGAAT CACATCGTCT ATGAGCCGTT CAGCCGCAAG
CTGATTGCCA GCATTGGCGC CGGCGGCAGC GGATTCACTG CGAATTCGGT GACAACGATC
ATCCCCGACA CGGCGACGGT TGGCACGACC ATGTTGCTCG GCGCCGCGCC GACCAGCCTG
GCGGTCACAT CCGATGGACA GGCTCTGTAC GCGACCTTGC CGAGTGTGCC GAGCGTGGCG
CGCTTCAACC TGCTCGCACA GAAGCTCGAC TTCACGTACA CGGTGCCGAA GGGTTCATCC
TTCACTGGCA CGATTAACCT CCGCGGCGTT TCTACGCAAC CGGGGAATGT GAACACCGTC
GCTCTGGATC TTGGCGCGAG CAACGGCATC GGCATTTACG ACTTCAACTC CACGACGAAG
ACGGCTGCGT TGCGTGGAAG CAATACCGGC AACTACACCG GTTCCTGCGT TCGTTATTCC
GATTCGACGA ACCTGATGGC GTTTGACTCG GACAGCAACC TGACGTTCAA CCACTTCGCA
GTGCCCGCGG CCGGGTTTGC CTATAGCAAT CCGACGCAGT ACAGCACCTG GTCGCTGGCC
AGCTTCAATT GCTTCCAGAT GAACGGCGGC TATGCCTTCG CGAACAAGGG CGGAGCGGCC
ATCCCGGTAT CCGCGGCGAC GACGGAGGTC GGCGTCTTCA AGCCGATCCC GAACGTCACG
ACCTCGACCA TGCAGGTTGT GGCGCCGGAC GTTTCGCTGC ACGTGGTCTT CTATCTGGCG
CAGACACATT CACTTTCCAG CACGAGCGCG GTAGACGGCT TGGTGACCTA CAACCAGACG
ACGTACATGC CGAACACGAC GATCCCGATG GGCCTGGACC TGATCGAAAA CACGACGTCG
TTCGGTGGCG TGGATTTAGT GCGCTGGGGG CAGGATGGCT TGGCCGCGTT GACTAGCACC
GGTAAGATCT ACTTGCTGCG CGGCGGCGCC GTGGTTCCGC AACTGCTTTC GACGCGCACG
GCGGCAACGC TGACGTCGGC ATCGGTTACG TCGGTGACAC ATGGTTCGGG CAACCTGTTG
ATCAGTGTGG TGGGCACGAA CTTCCAGAGC GGTATGGTGC TGACCTGGAA TGGCAACTAT
CGCACGACAA ACGTGACGGA CGCGACGCAT GCAACGGTTG CGATTCCGGC ATCGGATTTT
GCGAGCATCG GAGCCGGAAC GATTACGGCA GTGAACGCGG GGGCTCCGGC TTCCTCGGGC
CTGTCCATCA CCATCAACTA G
 
Protein sequence
MKSSKVELQL CLAIVLCLLA CTPLFAAAPT LNSISPKSAP LNTAVTLQLV GANFASNSQV 
YFNGNAVPTT FSSTTVLQAS VPAASVATPG NFAVTVTTPS MGTSAALMFT SYVALPNNSM
AYSAATGQLY VSVPSTAGMP YGNSVVAIDP VTGAITKSIF VGSEPNKMAV SADGTVLWVG
LDGSSAVRQV SLTAGTAGAK ITLGSNTGTN APPVALSLAA LPGSPNSFVV SMTAPLGGTV
VAIYDNATRR ANTWSASTYS GNALQTNATT SEVYVGGPTY YQPLSYSATG LSVPRLGSSG
NFTGSTDDLQ VVNGEVYTDL GALYDAETGA RNGSLLNGSN LAAGPTFTDT PLGKTFVFDS
PTANKYTQVQ VFTTSTSALA ATFPLNLASN TTGTPSHLLR WGTNGLAVRD NVAIYAFRSA
QVTNLAGINA DLSVTLAQSG TPTTGNSITY TATVKNAGPA TSTNVAFTAQ APATASIVSI
TPTVGSCSKL NGLSCNLGSL ATGATTTVTV VAKQMLAGSV VLNAQVFGSE NDPNLANNQA
STSVLTITGN PYNGVPTITS ISPAAIQAGS GTTMVTVTGT GFSTAASILI DGTALATTVL
SSTQATAMVP STKLASLGWS KINVSNPAPG GGVSYVLPLS VFKVLSAGAN HIVYEPFSRK
LIASIGAGGS GFTANSVTTI IPDTATVGTT MLLGAAPTSL AVTSDGQALY ATLPSVPSVA
RFNLLAQKLD FTYTVPKGSS FTGTINLRGV STQPGNVNTV ALDLGASNGI GIYDFNSTTK
TAALRGSNTG NYTGSCVRYS DSTNLMAFDS DSNLTFNHFA VPAAGFAYSN PTQYSTWSLA
SFNCFQMNGG YAFANKGGAA IPVSAATTEV GVFKPIPNVT TSTMQVVAPD VSLHVVFYLA
QTHSLSSTSA VDGLVTYNQT TYMPNTTIPM GLDLIENTTS FGGVDLVRWG QDGLAALTST
GKIYLLRGGA VVPQLLSTRT AATLTSASVT SVTHGSGNLL ISVVGTNFQS GMVLTWNGNY
RTTNVTDATH ATVAIPASDF ASIGAGTITA VNAGAPASSG LSITIN