Gene Daro_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1854 
Symbol 
ID3570102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1993479 
End bp1994801 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content62% 
IMG OID637680325 
Productserine/threonine protein kinase 
Protein accessionYP_285070 
Protein GI71907483 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.13717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATGA ACACCCTGGA AGCGCTGCGC GCCGGCAAGC TGGCCGGCAG CCGTCGCCTG 
AAACTGGCCT GTGGCCTGAG CGAGTTCCCT CGCGAGATTT TCGCTCTGGC GGATACGCTG
GAGATTCTCG ATTTATCAGG GAATGCACTG TCCACGCTGC CCGACGATCT GCCCCGGCTG
CAGAAGATGC GCATTCTTTT TTGTTCGGAC AACCATTTCA CGGAATTGCC GGCCGTGATC
GGCCAATGCG CCCAGCTGGA AATGGTCGGC TTCAAGGCGA ACCGGATTCG GCAGGTGCCG
GCGGCAGCAT TGCCGCCCAA GTTACGCTGG CTGATCCTGA CCGATAACCG GATTGCCGAA
CTGCCGCCGG AAATCGGTCA CTGCTCGCGC TTGCAGAAGC TGATGCTGGC CGGCAACCAG
TTGCGCACAC TGCCGGGCGA GATGGCCAAT TGCACCCGGC TGGAGCTGGT ACGCATCGCG
GCCAATCGCC TGACAGCCTT GCCGGAATGG CTATTGTCCT TGCCACGACT TTCCTGGCTG
GCCTTTGCCG GCAATCCGTT TTGTCAGGAG GTACCGGTTC AAGTGGGGGG GGCGCCCATT
CACTGGGACG ATCTGCACGT CAGCCATCAA CTGGGCGAGG GCGCTTCAGG CGTCATTCAT
CAGGCCGAAT GGCGTTCGGC CGGTAAAGCG CAACCGGTGG CGCTCAAACT GTTCAAGGGC
GTGGTGACGA GCGATGGCTT GCCGCTGAAT GAGATGAACG CCTGCATCTC CGCCGGTACG
CACCCGCATC TGATTTCGGT ACTCGGCAAG CTGGCGGCCC ACCCGGAAGA TGCCCACGGG
CTGGTCATGG CGCTGATCGA CAGCAGCTTC CGCAACCTGG CCGGGCCGCC CAGTCTGGAT
TCCTGCACGC GCGATGTTTA TCCGGCAGCC TTGAGCTTCG ATCTCGATGC GGCCATTCAA
ATCGCGCTCG GCATCGCCTC GGCGGCCGAA CACCTGCACG CGCAGGGCAT CATGCATGGC
GATCTCTATG GCCACAATAT CCTGCATGGT GCCCATGGGC GGGCCATCCT CGGCGACTTT
GGCGCGGCGT CGTTCGTGCC GCCGGATGAT CCGCTCGTTG CCAAGGCCCT GCAACGCATC
GAGGTGCGGG CGTTTTCCTG TTTGCTGGAG GAGTTGCTGG AGCGGATTGG ACCTGCCGCG
CTCAACCCGG AAAAAGTCGA AAAACTGAAA TCCCTGCTGG CCGACTGCGC CCAGGAAAAC
GTCTCGGCTC GCCCGCTGTT CGCCGAGATC GTTTCGCGGC TCGGTGCCTT AAAACTGGCC
TGA
 
Protein sequence
MPMNTLEALR AGKLAGSRRL KLACGLSEFP REIFALADTL EILDLSGNAL STLPDDLPRL 
QKMRILFCSD NHFTELPAVI GQCAQLEMVG FKANRIRQVP AAALPPKLRW LILTDNRIAE
LPPEIGHCSR LQKLMLAGNQ LRTLPGEMAN CTRLELVRIA ANRLTALPEW LLSLPRLSWL
AFAGNPFCQE VPVQVGGAPI HWDDLHVSHQ LGEGASGVIH QAEWRSAGKA QPVALKLFKG
VVTSDGLPLN EMNACISAGT HPHLISVLGK LAAHPEDAHG LVMALIDSSF RNLAGPPSLD
SCTRDVYPAA LSFDLDAAIQ IALGIASAAE HLHAQGIMHG DLYGHNILHG AHGRAILGDF
GAASFVPPDD PLVAKALQRI EVRAFSCLLE ELLERIGPAA LNPEKVEKLK SLLADCAQEN
VSARPLFAEI VSRLGALKLA