Gene Acid345_1909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1909 
Symbol 
ID4069387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2293371 
End bp2295257 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content59% 
IMG OID637983920 
Producthypothetical protein 
Protein accessionYP_590984 
Protein GI94968936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.383493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC ACCTAGTCGG CTTTTGTCTT GCCGCGTCAT TGTGTGGGCT GGCAGTTGCG 
CAGAATGCAG CCCCCGCGAC CTCGTCCCAA CAGGAGGTCG TGCCACGATT GATCCGCTTC
TCAGGTCAAC TGAAAGACGA TGCCGGCAAA GCGATGTCAG GCACCGTCGG GATTACGTTC
ACACTTCATA AAACCCAAAA TGACAATGCC GCGCTATGGA TCGAGACGCA GAACGTGAAA
CTCGATAGCG AAGGCAAATA CACCGTATTA CTTGGTTCTA CCAAATCCAC AGGAGTTGCG
GCGGAGTTGT TTATCAGCGG CGAAGCGCAG TGGCTGGGCG TTCGCGTGGA AGGGAAGGCG
GAACAACCGC GGGTGTTGAT GGTGAGCGTG CCTTACGCGT TGAAGGCGAA GGAAGCAGAG
ACGTTGGCGG GACATTCCGC GACAGAGTTC GTGACGTCCG ACAAGCTTAC GACCGTAGTG
AAAGAGCAGA TGCAGCAGAA CGGCGCGACG GTGAAGAAGG ACGGAACCGG CGTTAAAGGA
AATGTGCTTT CCTCGACCGC GACGAATTTC ACCGACACTA CGACCAACCA GGTTGTTCTG
GTGACGCAGA AAGGAACGGG CGCGGCGCTT TCTACCACGA GTCCGGGCGC CAGCATCACG
GCGTACAGCA CAGGGGCCAA CACGGCGGCG AACGCGCTGT ATGCGCAGGC AGTGGCGCCG
AGTGGGAATG CTGTGTACGG AAACGAAACC GCGGCTACTG GAACAGGCGC TGGAGTACTT
GGCCGCTCAC TTTCGGTGTC CGGGTACGGT GTGTATGGCG TGAACGCTGC GACGACAGGA
ACGGCGGTCG GCATTCGCGG AACGTCGGCA TCGAGCGGCG GCATCGGCGT GTATGGAACT
GAGACGGCGA CAACGGGAAC CGCAACGGGC ATCTACGGCA CAACGGCATC GCCAGCCGGC
TTCGGGATGG AAGGCATCAA CCTGGCGACG ACGGGGTCGG CGGTGGGCGG CTTCGCGCGT
TCTGCTTCGA CGTCAGGTGT GGGCCTTCGC GGTTACGCGA GTTCTGCGAC CGGCACGACG
ACCGGTGTGT TGGCACAGGT CGCGAGCTCT TCGGGAACCG CAGCAATACT GCAAAACACA
GCCAGCGGAG CTCTTCTCCA AGGCCAATCT GGTTCTGGAT TAACGACCGT GTTCTCGATC
GACGGCAGTG GAAATTTCAA CGCGAACGGT GAATTCTTCT CACCGAATTC GCTTGTGAAT
GTCGGGTGGG GATACTTCGC GCCGACACCT TTGAACTTCT ACGCCATGGA GGCAATCAGC
ACGAGCAATG CCTCCGGACA TCCAACCGCG ATCATCAAGA ACAACGACAA CACGAATGCC
GGCAGCCAGG CGTTGGAAGT GGACGGGCCG GGATTTGGCG GCCAGTGCAC GATTGACGTG
AGTGGAAACC TGTTCTGCAC CGGAACACTC GCTCCCGTGA TAGCCGGTCC GAACGGCCAG
AAGACGGCGC TGTACACCAT GCAAACTACC GAAAACCTGA TGGAAGATTT CGGTCGGGGA
GCTTTGGTGA ATGGCGTCGC GAAGGTCACG ATCGATCCTA AGTTTGCGAC TGCAGTCAAT
GCAGGCGACT ACCACGTGTA CGTAACCCCG GGCGGAGATT GCGAAGGCCT CTTCATCACC
AACCGCACGG CAACGTCTTT TGAAGTGCAC GAACTCCGCG GTGGCAAGTC GGCGATCAGC
TTCGATTATC GGATCGTGGC GCACCGCAAG GGCTTTGAAA CCGACCGCAT GCCGGACGTA
ACGCAACGGC TGGCGCGGAA GTCCGAAATT GAGGGACAAA CGGAAGCCAG AACCAGGGCG
CTGCCGCCGG CAGAGCCACA GAAGTAG
 
Protein sequence
MKKHLVGFCL AASLCGLAVA QNAAPATSSQ QEVVPRLIRF SGQLKDDAGK AMSGTVGITF 
TLHKTQNDNA ALWIETQNVK LDSEGKYTVL LGSTKSTGVA AELFISGEAQ WLGVRVEGKA
EQPRVLMVSV PYALKAKEAE TLAGHSATEF VTSDKLTTVV KEQMQQNGAT VKKDGTGVKG
NVLSSTATNF TDTTTNQVVL VTQKGTGAAL STTSPGASIT AYSTGANTAA NALYAQAVAP
SGNAVYGNET AATGTGAGVL GRSLSVSGYG VYGVNAATTG TAVGIRGTSA SSGGIGVYGT
ETATTGTATG IYGTTASPAG FGMEGINLAT TGSAVGGFAR SASTSGVGLR GYASSATGTT
TGVLAQVASS SGTAAILQNT ASGALLQGQS GSGLTTVFSI DGSGNFNANG EFFSPNSLVN
VGWGYFAPTP LNFYAMEAIS TSNASGHPTA IIKNNDNTNA GSQALEVDGP GFGGQCTIDV
SGNLFCTGTL APVIAGPNGQ KTALYTMQTT ENLMEDFGRG ALVNGVAKVT IDPKFATAVN
AGDYHVYVTP GGDCEGLFIT NRTATSFEVH ELRGGKSAIS FDYRIVAHRK GFETDRMPDV
TQRLARKSEI EGQTEARTRA LPPAEPQK