Gene Acid345_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0072 
Symbol 
ID4068985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp72508 
End bp75312 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content63% 
IMG OID637982072 
Productresponse regulator receiver protein 
Protein accessionYP_589151 
Protein GI94967103 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTTAA AGATTCTGCT GGCCGACGAC AGTATGACTG CCCAAAACAT GGGCAAGAAA 
ATATTGACCG ACGCCGGCTA TGAGGTTGTG GCCGTAAGCA ATGGCGCAGC CGCTGTTAAA
AAAATCGCGG AAATTAATCC CCAGCTCTGC ATCCTCGATA TCTACATGCC GGGCTACACC
GGCTTGGAAG TTTGCGAAAA GATTAAGTCC TCGAGCGTTA CGGCCCGCAT CCCCGTCCTG
TTGACGGTGG GCAAAATGGA GCCCTACAAG CCTGAGGAAG GCGCCCGGGT ACGCGCCGAC
GGTGTCATCG TCAAACCCTT TGAGGCCAGC GAGTTACTGC GCGCTGTGCA CCGATTTTCG
CAGCGAATCG CAAACGTAGC CCAGCCCGGA CGACCCGGAG CACCAGCGCA ACCCGCGTCG
CCACAAACAC CGCCGCCACC GCCGGCTTCC ACGCCAGCGA CCTACGAGCG GACGGTGAAG
CTGACCCGCG AACAGATCCA GGAATACATG GAGCCCGGCC TGCAGGGATG GCGAGGCGGT
ACTTCTTCTC CCGAGCACGC CGGCGCGCAC GCGCGCACCG AAGCGCCGGA CATGCCGAAA
TCGCCTGCAT ACCCGGTGGA AGGCGTGGTC GCCAGCATGA AGACTGGGTC CACTTCGGAC
GCCATGCCGG CGTTTACGAT ATCGGGCGCG GTGGCTGAGG CAGAAACCTC CACCGTCGTC
GCGGAAACAT CGGCTGCCGT CGCCGAACTC GAGACGCCCG CTGTTCCCGT GGAAACTGTC
GCGATGGAAA CCGTGGCAAT GCCGGTCGAA TCCGCGCTCG CCGCTACTGA GATCGCGCCC
GTCACAGACG TTGCCATCGT GACAGAAGAA CCTGCGCCGG TCTCCACCGA AGCCGCGGTT
TCGGTCGCGG AAACCGCGCC GCCCGTGCCA GAAATCGCCG CGCCCGCAAT AGAGCCAGAA
GCACCCACAG AAGCGCCGAT TGCAAGCGCA TCGTCCGTGC TTCCGGGTTT CGTAGACTAC
CTGATCGCGC ACTCGAGCGT AGAAACTGCG CCGGAAACCG TGGTTCCCGC GGAGATTGCG
CCGGAAACAC CTGTTGCGGT CGCGGAAGCC GCACCGGAAA CCGTCGCTGC CCTAGAAACA
GAAACACCCG TGGTTCCCAC GCCCGAAGCG GCGCCCGTCG CCCTGGAGAG CAACGAATTC
GCGGTGAGTG CGCCTGCAGA ACCGGCGCCT CTCGCGGTTG AGCCTGCGCC CGTCGCGGAA
ATTTCCGTCG ATCCCGCGCT GCAGCAGACC GCAGATGGCG TGCTTCCGAC GGGAGCAGGA
TTCCTCTCTG ACGATATCCG CGCCGCCCAC GACATTGGCC ACAAGATCCC CGCCGTCGAT
CCCGCGCTCG AGATCACGCC CGAAGTCGCA ATCACCGCGA CACCGGACCC GCACCTGGAA
TCGAACGAGC ATATCCACGT AGCTAACGCG ACCGAAGATG GCTTGGTTCC GACCTCGCGC
TCGGCGGAAT CCGAAGGCGT GGTCACCACC GAAGATCCGA ACCTCTCCCC GATGGAAGGC
CTGACCGATG CAGTCCTGCC ACTCGAAGGC GGACGGCTCA TCATCGAAGA ACCTGCGCAC
GTCGCGGAAC CATCGGAGCC CGAAGCAGTA GCGGAAGCGA TTGCTACCGA GACCGCTGTC
GAAACTCCAG CGGAGACGGT TGCGACGGAA GCAGTCACTA CCGACACACC AGCCGTAGAA
GCAGTTGCTA CCGAGGCTGC TCCTGGCGAG ACACCTGTTG TTCAAGAAGC GGTCGTCGAG
ACGTCTGCAG TCGAAGCACC CGTAGCCGAA ACGCCGGCCG TAGTTGAGAC GCCATCCGTC
GAGTCCACTC CTTCCAAGTC GGGAAAGAAG GCGCGCAAGG GAATGCGCCC GATTCCCGCT
ACCACATCTG GAACCGAGAC TCCGGCAGAA GCATCAGCCC AAACGGCCGA GAACACCACT
GCTCCGGCAG AAACCGCTGC GGTCGCCGAA GCTGCGACAG TCGCGCCGGA AACGCCAGCC
CCGCCAGTCG AAGAAAAGAA GGAAGCTCCT TCGGCGAAGG CGGATGTCAC GGAAGAGATT
GCGGCCGTGC TCGACTTGCT CGGCCAAACC ACCACGTCTT TGAGCCCTTC GGCTCCGGGA
GCTGGTGTAG CGGCGGCCAG TGCCGCCGTG GAAGAAGCGC CCGCCGTCGT CATTGGCGCG
GTGCGCGTTT GGATGGCGGA AGAAGTAGCG CTGTCCGAAG CAGAAACAAC CATATCGCTC
GAGAACGAAA TGCGCATTGC TCAAGCGCCG AAGGAAGTTG TGCCCGAAGC TGTCGTCGAG
CCGGTTGCAG TTGCTGAACC GTCCGTGACT GCGGAAGTGG CGCCACAACC TGAGGCTGCA
ACAGCAGCGC CTGCGGAGGA AACGGTGCGG GATGACAAGA ACGCGCCCCA ACCCGCGGTG
ACCCCGGACA CGGGAGTTCC ACCTCCCGAT GCCGGAGCAT CGCCCACCCA AGAACTGGCG
GCAGCCATGG CTGCCGCGTT CGGCGGAGCC TTGCCCCAAG AGGACCTGGC TCGTGCCGAA
GAGACGCAGT CGATTTCGGA AGAGGTTCGC CGCTTTGCCA TGGGCGACGC CTATAAACCG
AAACCGGCTG TCGGTTTCGG CGGCGCCGCC ATTGAAGATC CAAGCGAGAA GAGCATCCCC
TCGCAGGACA AGCTGGCCGC AGCCATCAGC CGAGCGCTCG ATCGCCTGAA GCCACAGATC
GTGGCCGAGA TCCTCAAAGA ACTCGAAGCC GGCGAAGAGA AGTAG
 
Protein sequence
MALKILLADD SMTAQNMGKK ILTDAGYEVV AVSNGAAAVK KIAEINPQLC ILDIYMPGYT 
GLEVCEKIKS SSVTARIPVL LTVGKMEPYK PEEGARVRAD GVIVKPFEAS ELLRAVHRFS
QRIANVAQPG RPGAPAQPAS PQTPPPPPAS TPATYERTVK LTREQIQEYM EPGLQGWRGG
TSSPEHAGAH ARTEAPDMPK SPAYPVEGVV ASMKTGSTSD AMPAFTISGA VAEAETSTVV
AETSAAVAEL ETPAVPVETV AMETVAMPVE SALAATEIAP VTDVAIVTEE PAPVSTEAAV
SVAETAPPVP EIAAPAIEPE APTEAPIASA SSVLPGFVDY LIAHSSVETA PETVVPAEIA
PETPVAVAEA APETVAALET ETPVVPTPEA APVALESNEF AVSAPAEPAP LAVEPAPVAE
ISVDPALQQT ADGVLPTGAG FLSDDIRAAH DIGHKIPAVD PALEITPEVA ITATPDPHLE
SNEHIHVANA TEDGLVPTSR SAESEGVVTT EDPNLSPMEG LTDAVLPLEG GRLIIEEPAH
VAEPSEPEAV AEAIATETAV ETPAETVATE AVTTDTPAVE AVATEAAPGE TPVVQEAVVE
TSAVEAPVAE TPAVVETPSV ESTPSKSGKK ARKGMRPIPA TTSGTETPAE ASAQTAENTT
APAETAAVAE AATVAPETPA PPVEEKKEAP SAKADVTEEI AAVLDLLGQT TTSLSPSAPG
AGVAAASAAV EEAPAVVIGA VRVWMAEEVA LSEAETTISL ENEMRIAQAP KEVVPEAVVE
PVAVAEPSVT AEVAPQPEAA TAAPAEETVR DDKNAPQPAV TPDTGVPPPD AGASPTQELA
AAMAAAFGGA LPQEDLARAE ETQSISEEVR RFAMGDAYKP KPAVGFGGAA IEDPSEKSIP
SQDKLAAAIS RALDRLKPQI VAEILKELEA GEEK