Gene Daro_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2244 
Symbol 
ID3566431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2426675 
End bp2427895 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID637680711 
Producthypothetical protein 
Protein accessionYP_285451 
Protein GI71907864 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.311601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCCC AACGCTTCGC CATCAAGGGC CGCGTGATCG GCGCGGATGA TTCGCAACTG 
CAGGATGCGC TCGCCCACGT CTACGAGACA CCCGAGCGGC CGCGCTGCCT GTGCGTGCCT
GGCGGCGTGG AGATGTACGT GGCGCGGCAT CGCCAGTTCG TGGTCAAGCG CATGCCGGAC
AGCGGCAGCA CCCACCATCC GAGCTGCCCT TCCTACGAGC CGGAGGTCCA GCAGTCCGGA
CTGGGCGAAC TGGTCGGCGA AGCGGTGCTG GAGTCGGAAC CGGGGCGGTT CGAATTGCGT
GTGGATTTCC CCTGGATGCG GGTAATCGGG CGCGCCGTGC CGCGCGGCGA ACCCCAGGAG
GTGGCAGAAA TCGGCGTGCC GAGGAGGCAG ATGACGTTGC GCGCCTTGAT GCACTTCCTG
TTCGAGCGCG CGGGCTTCAA CCGCTGGAGC CCGGCGATGG CGGGCAAACG CAACCAGGGC
GTGCTGCGCA AGTACCTGCT GGAGGCGGCC GAGGAGATCA TGGTCAAGGG CATCCCGCTG
GCCGAGCGCC TGTATGTGCC CGAACCGTTC AGCGAGAGCG CCAAGGCCGA GGCGGCACAG
CGCCGGCGCG AGAAGCTGGC CGTGCTACGT CCCAAGGACG GGCAGACGCC GCTGGCCGTC
GTGATCGGCG AATTCAAGAC GAGCGAGGCC ACAAGCCAGG GCCGCCGGGT CTGGATCCGG
CACATGCCGG ACGCGCCGCT GCTGATCGCC AGCCGGAGTT GGGAGCGGAT CGAGCGGGTG
TTCGCGCCAC TGTTTGAGGC GCGCGATGCC GATACCGGCC ACCCGGTCCG GGTCATCCTG
GCCGCGTTGA TCCGCGCCCG CCGTGAATAC ACCTACGAGA TCGATGCGGC GAGTTTGATG
TTGACCAGCG AGCACTGGAT TCCCATTGAG GGCGTGCATG AATTGCCCCT GATCGACGCC
CTAGTCGCCC AGCACCGCCG CTTCGTCAAA CCGCTGCGCT ATGACGCTCG GAGCGCGGCG
GCCTTTCCGA ATGTCCTGCT GCTGGATGCC GGGACGGCGC CGGTGCCGCT ACACGTGGTG
AGCGCTTTCA TGGATACGAA GGAGCGGCTT TCCAAAGAAA GGGCAATTGC GGAAATCGGT
GCGCAGGGCG CTTGGGTCTG GAGGACAGAG GAACCGATGC CGCCCTTCAG ACGAGATGTA
GAGCTGGCGC TGCCGTCTTG A
 
Protein sequence
MDSQRFAIKG RVIGADDSQL QDALAHVYET PERPRCLCVP GGVEMYVARH RQFVVKRMPD 
SGSTHHPSCP SYEPEVQQSG LGELVGEAVL ESEPGRFELR VDFPWMRVIG RAVPRGEPQE
VAEIGVPRRQ MTLRALMHFL FERAGFNRWS PAMAGKRNQG VLRKYLLEAA EEIMVKGIPL
AERLYVPEPF SESAKAEAAQ RRREKLAVLR PKDGQTPLAV VIGEFKTSEA TSQGRRVWIR
HMPDAPLLIA SRSWERIERV FAPLFEARDA DTGHPVRVIL AALIRARREY TYEIDAASLM
LTSEHWIPIE GVHELPLIDA LVAQHRRFVK PLRYDARSAA AFPNVLLLDA GTAPVPLHVV
SAFMDTKERL SKERAIAEIG AQGAWVWRTE EPMPPFRRDV ELALPS