Gene Acid345_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0444 
Symbol 
ID4071691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp521765 
End bp522871 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content59% 
IMG OID637982448 
ProductLacI family transcription regulator 
Protein accessionYP_589523 
Protein GI94967475 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAGGA AGAGACGCGG GATTCATCTC ATCGCGGAGA TGGCGCAGGT GTCCATCGGC 
ACGGTGGACC GTGCGCTGCA CGGGCGAAAC GGAATCAGCC ACGCGACGCG CGAACGGATC
CTCCAGATCG CGCGGGAAAT TGGATACACG CCGAACCTTG CTGCTCGCGC TCTCTCGGCA
GGGAAAGCCG GAGTGCGCAT CGGAGTTTGC ATTCCGCGCG AAATCCATTT CTTCTACGAC
CAACTCTGGG GCGGAGTGCT CGAAGAAGCC CGCCGCCTGG AGCATATGGG CGTTGCATTC
GAGTTCCGGC CGGTACGAAA TCTCGGCGAG GGCGATACCG AGGCGCTGCG TGAATTGATC
GAGGACGGCG TGGATGGCGT CATTCTCACC GCAGGAAATC CAGATGGATT GACGCCCCTG
GTGAATGAGG CTGAGGGCCG GAACATTCCT GTCGTCTGTG TATCTACCGA CGCTCCGGAG
AGCCTGCGTT CCAGCATCGT TTGCGTTGAG CCGAGACTCA ATGGCCAGCT TGCCGGCGAG
TTGATGGGAA AGTTCGTGCC CGCAGGATCG AAGGTTGCCG TGGTTGCCGG CATGCTCACT
GCCATGGACC ATCTCAGCAA GACGGAGGGC TTCTCGGTAA CGTTCCCGAA ACACTGCCAT
GGCGGCCAAA TCGTGGGCGT TATCGAGGGC CACGAGGACG AGGACGAAAG CTTCCAGAAG
ACCTTCGATC TACTGGGTAG AGTTCCGGAC TTGGCTGGTC TTTACGTCAA CACCGTGAAC
TGTCTTCCCG TGTGTCGAGC ACTTGGGGCG CGCCAACTCG CAGGGAGAGT CAAACTGATT
ACGACCGACT TGTTTGCGGA GATGGCGACC TATTTCGCCA AGGGCACAAT CACCGCATCG
ATCTACCAGC AACCCCACCG ACAAGGCCAA CTGGCGGTCA GATTACTCGC CGACAACCTC
ACGGCAAACC AGCCATTTCC GCCTACTGTG CACTTAAGTC CTGGGGTTGT CATGTCTTCG
AATTTGCACC TTTTCCGCGA GATGCGTCGC AGTGAAACGA AGCTTCCGGA CGTGGTGCGC
GTGGCCTCTC TCGCCACGAA GGTGTAG
 
Protein sequence
MTRKRRGIHL IAEMAQVSIG TVDRALHGRN GISHATRERI LQIAREIGYT PNLAARALSA 
GKAGVRIGVC IPREIHFFYD QLWGGVLEEA RRLEHMGVAF EFRPVRNLGE GDTEALRELI
EDGVDGVILT AGNPDGLTPL VNEAEGRNIP VVCVSTDAPE SLRSSIVCVE PRLNGQLAGE
LMGKFVPAGS KVAVVAGMLT AMDHLSKTEG FSVTFPKHCH GGQIVGVIEG HEDEDESFQK
TFDLLGRVPD LAGLYVNTVN CLPVCRALGA RQLAGRVKLI TTDLFAEMAT YFAKGTITAS
IYQQPHRQGQ LAVRLLADNL TANQPFPPTV HLSPGVVMSS NLHLFREMRR SETKLPDVVR
VASLATKV