Gene Acid345_2446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2446 
Symbol 
ID4072881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2891676 
End bp2892665 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content57% 
IMG OID637984463 
ProductThiJ/PfpI family protein 
Protein accessionYP_591521 
Protein GI94969473 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCG TTCCGCAGGG AGGCAGCGCT GACGCGTTGC CCACCGTGAC CATTCTGTTA 
TTTAACGGCG CTCAACTCAT AGACTTCGCC GGTCCTTGGG AGGTGTTTGG GACAGCCGGA
TTGCTGGTCC ACACGGTGGC CGAAAAGGCA GAGCCACTTA CTGCGGTTTT CGGAGCAAAG
ATCATCCCTG ATTACACCTT CGAAAACAGC CCAAGGACGC ATCTGCTTCT GATTCCTGGA
GGCGGTGTTT TTCAAGAGGC CATTAAGAAT CCGGCCTTGA TTCACTGGAT CCAGACGAAG
GCAACAGAAG CAAAGGTCGT GATGTCCGTT TGCACCGGTG CATTCCTTCT GCAAGCCGCA
GGGTTGCTCG AGGGACATAC CGTGACAACG ACCTACGGAA TGATCGATGA CCTCTCCGGC
CCGAAAACCA AAGTCGTTTA TGACCGGCGG TTCGTGGAAA GCGGCAATCT GATCACCACC
GCGGGATTGT CCTCTGGCAT TGACGGCGCT CTGTATGCTG TGTCTCGGCT TCTCGGCAGC
GGCATAGCGC AAAGCGTGGC ACTGGAAATG GAATACAACT GGGATCCAAC CGGCAACTAT
GCGCGCGCGG CCCTCGCCGA CCGCTTCCTG CCAGACGGTC TCGCGTACGC CAAACCTCGA
ATCAAAGGCG CACAAGCCAA GATGATCTCC ACAGCTGGTG ACCGAGATCA GTGGGAAACG
AAAATTGTCG TGTCACATCC TGAGACGGTG AGCGAAGTTC TCGAACTGAT GCGGGCGCGG
ATCAAGGCAA ACACTGCAAC CGGCGGGATG TTCAAGCCGG TTTCCCACAT CCACGGACCT
CCGCAGGTGA GTGTTGCGGG CGGCGGAAAA TTGACGTGGA AGTTCACCGA CGACGACAGC
CAGCAATGGA GTGGTGAGTG TACGGTCGAA CCCTATGAAC AGCGAGTCGA CCGCCTGCTG
GTGACGATCC GCGTTGCTCG AGCGAAATAG
 
Protein sequence
MTLVPQGGSA DALPTVTILL FNGAQLIDFA GPWEVFGTAG LLVHTVAEKA EPLTAVFGAK 
IIPDYTFENS PRTHLLLIPG GGVFQEAIKN PALIHWIQTK ATEAKVVMSV CTGAFLLQAA
GLLEGHTVTT TYGMIDDLSG PKTKVVYDRR FVESGNLITT AGLSSGIDGA LYAVSRLLGS
GIAQSVALEM EYNWDPTGNY ARAALADRFL PDGLAYAKPR IKGAQAKMIS TAGDRDQWET
KIVVSHPETV SEVLELMRAR IKANTATGGM FKPVSHIHGP PQVSVAGGGK LTWKFTDDDS
QQWSGECTVE PYEQRVDRLL VTIRVARAK