Gene Acid345_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1590 
Symbol 
ID4069028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1937635 
End bp1938978 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content60% 
IMG OID637983599 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_590666 
Protein GI94968618 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.632481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCTTG TTGCCAATCC GACGTTCTTC AAGTTCTTCA TGATGCTCCT GGGGAGCGTT 
GTGATCATCG CCGTCGGAGC CCTGGCGTTT CGCTATCTCA AGCAAATGAT GCTGCAGGAC
GTAGATATGA GCAGCCGGCC TGCCGGTCGC GAAAGCAGCG GGTTCGCCTT CCACACCTAC
CAGGGCGTCA TCTCCAGACT CAAAGAGCAG GAGCAGGAAC TCAAGAGCCT GCGCGAAGCG
GCCAGCAGCC GTGCATCGGC CTCCGAGAGC CTGAGCGTGG CAGTGCTGAC CAACCTCGGC
AGCGGAGTCG TCGTATTCAA TCCGGCAGGC ATTGTGCAGC AGGCGAATCC CGCAGCGCGC
GAGATCCTCG GGTACGCATC ACCCACCGGT CTTCACACGC GCGACCTGAT GAAAGGAGTG
CATGCAGTGC GCACCGAAAC CGGCCAGACC GCAGTCGAGG TGTCGTCGTT CCTGCGCGCC
ATTACCGACG CACCGCAGCA CAACCAGACC ACGAGGTTTG AGGTGGACTA CCGCACGCCG
GGTGGCGTGG AGAAGGTGCT CGCGATTACC GTCTCGCCAA TCCGATCGAG TGTCGGCGCT
TTTCTTGGCT CGACTTGTTT GATCACCGAT CGCACCCAGA TCAGCAGCCT GGCGCGACAG
ATGCGCGTGC GCGAGAACCT TGCGTCGCTG GGCGAGATGT CGGCCGGTAT TGCGCATGAG
TTCAAGAATT CGTTGGCGAC AATCTCTGGC TACGCACAGA TGCTCAAAGG CGAGCCCGAT
GAAACCGTCA GCGAGTTTGC AACCCGCATC CAGGGCACGA CTGAGAACCT TACCAGCGTG
GTAACAGACT TCCTGAATTT CGCACGGCCG CAACAGCTCA AGCGCGAACC GATCGAACTG
CGCCCAGTGC TGGAAGACTG CGCGCGAGAG ACGAAGGTCA CGCTGGAGTT CCAGAACTTT
CCTGACCGGC TCGTTGTGAA TGGCGACCGC ACGGCGCTGC GGCAGGTCTT CAGTAATCTC
TTGCGAAATG CCGCCGAAGC AGCACGGAAC AACACTCCGG TCCGCGTCAC GGTACGAGCA
TCGGCGACCG ACACAAGCGT GGAGTTAAGT CTCCACGACA ACGGAACGGG CATTCCGGAA
GAGGCCCTGA AGAACATCTT CATCCCCTTC TTCACCACCA AGCCACAGGG AACAGGGCTG
GGCTTAGCGC TGGTACATCG GATCGTGACC GAACACGGGG GTTCCATTCG CGCGGGCAAC
GACCTCGAAG GTGCGGTATT CACCTTAAGT CTGCCTTTAC AAAAACCTGC AGCAGAAAAG
CCCGCCACCG CAAACCCAAA GTAA
 
Protein sequence
MNLVANPTFF KFFMMLLGSV VIIAVGALAF RYLKQMMLQD VDMSSRPAGR ESSGFAFHTY 
QGVISRLKEQ EQELKSLREA ASSRASASES LSVAVLTNLG SGVVVFNPAG IVQQANPAAR
EILGYASPTG LHTRDLMKGV HAVRTETGQT AVEVSSFLRA ITDAPQHNQT TRFEVDYRTP
GGVEKVLAIT VSPIRSSVGA FLGSTCLITD RTQISSLARQ MRVRENLASL GEMSAGIAHE
FKNSLATISG YAQMLKGEPD ETVSEFATRI QGTTENLTSV VTDFLNFARP QQLKREPIEL
RPVLEDCARE TKVTLEFQNF PDRLVVNGDR TALRQVFSNL LRNAAEAARN NTPVRVTVRA
SATDTSVELS LHDNGTGIPE EALKNIFIPF FTTKPQGTGL GLALVHRIVT EHGGSIRAGN
DLEGAVFTLS LPLQKPAAEK PATANPK