Gene Daro_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1130 
Symbol 
ID3570148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1237096 
End bp1238436 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content62% 
IMG OID637679597 
Producthypothetical protein 
Protein accessionYP_284356 
Protein GI71906769 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATG ATTCTCAGGG AACCCCGGAG GAAAATTCCG CCCCGGTTCC CCACGGACGC 
TGGTCGCGCC TGGCCCGTCT GGGCTCATTG GCCGGCGGCG TGGCCGGCAA TATGCTGGCC
GAGGGCGCCC GCCAGTTTGC ACAAGGCAAG CGGCCAAAAA TCCAGCAACT ACTGCTGACG
CCGGCCAATG CCCGCCGGGT GGCCGATCAA CTGGCGCAAC TACGCGGTGC GGCCATGAAA
GTGGGGCAGC TGCTATCGAT GGATGCCGGC GAACTGCTGC CGCCCGAACT GGCCGACATT
CTTGCCCGAC TGCGCGCCGA CGCCATCCCG ATGCCGATGA GCCAGGTGGT CAAGGTGCTC
AATACCAACT GGGGCGAAGG CTGGGATCGC CATTTCGAGC GTTTCTCCTT CACCCCGATG
GCAGCAGCCT CGATCGGCCA GGTGCATTTC GGGCAAAGGA AAGATGGCCG CCACTTGGCG
ATCAAGATCC AGTACCCCGG CGTTCGCCAG AGCATCGACA GCGATGTCGA CAATGTGGCC
ACCCTGCTCC GTGTTTCCGG CCTGCTGCCC AAAACGCTGG ACGTCAAACC ACTACTGGAA
GAAGCCAAAA AGCAATTGCA TGACGAAGCC GATTACCGTC GAGAGGGCGC CTGCATGATG
CAGTTTGCCG GCCTGCTGGC CGATGCCGAC GAGTTCATGG TGCCGGAAAT GCACGACGAT
CTGACCACGG AAAACATCCT GGCGATGACC CGCCTGGATG GCGTGGCCGT CGAGTCCCTG
AGCCATGCCC CGCAGGCGGA GCGGGACCGC ATCATCAGCC AGTTATTCAG GCTGCTGTTT
CGCGAGATTT TCGAATTCCG GCTAATCCAG ACTGACCCGA ACTTCGCCAA TTACCGATAC
GCCGCCGCGT CACAGCAGCT CATGCTGCTC GACTTCGGCG CTACCCGGGT GTACCCCGCG
GCCATGATCG ACAGCTATCG CCACCTGATG CTCAGCGCCA TTGCCGATGA TCGTTCGGCG
ATGAACCAGG CGGCCCAGGC GATCGGCTAT TTTCAGAGCG ATATTAAGGA GGGGCAGCGC
CAGGCTGTGC TGGATATTTT CGCGCTGGCC TGCGAGCCGC TGCGACAGGC AGGAGAATAC
GATTTCGGCA GTTCCGATCT GGCGCTGCGA ATTCGTGACG CGAGCATGGT ACTCGGCATG
GATCGGGATT TCTGGCACAC CCCACCGGCG GATGCGCTCT TTCTGCATCG CAAGCTGGGC
GGCCTGTATC TGCTGGCAGC AAGGCTCAAG GCGCGGGCGA ACTTGCATGA AATCGCTGCC
CGCCACTTGC TGGCCGGCTA G
 
Protein sequence
MSDDSQGTPE ENSAPVPHGR WSRLARLGSL AGGVAGNMLA EGARQFAQGK RPKIQQLLLT 
PANARRVADQ LAQLRGAAMK VGQLLSMDAG ELLPPELADI LARLRADAIP MPMSQVVKVL
NTNWGEGWDR HFERFSFTPM AAASIGQVHF GQRKDGRHLA IKIQYPGVRQ SIDSDVDNVA
TLLRVSGLLP KTLDVKPLLE EAKKQLHDEA DYRREGACMM QFAGLLADAD EFMVPEMHDD
LTTENILAMT RLDGVAVESL SHAPQAERDR IISQLFRLLF REIFEFRLIQ TDPNFANYRY
AAASQQLMLL DFGATRVYPA AMIDSYRHLM LSAIADDRSA MNQAAQAIGY FQSDIKEGQR
QAVLDIFALA CEPLRQAGEY DFGSSDLALR IRDASMVLGM DRDFWHTPPA DALFLHRKLG
GLYLLAARLK ARANLHEIAA RHLLAG