Gene Daro_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1039 
Symbol 
ID3568201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1139088 
End bp1140494 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content58% 
IMG OID637679500 
Producthypothetical protein 
Protein accessionYP_284265 
Protein GI71906678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGCA AGTCCCAAAA GGAAACGGAT ATGTCGAAGC ACCAAGAGAA TGGCCGCGAT 
GGCCTCACCG AGCAGCAACC CCAAAACCTG ACCGCCACTT CCCTGGCCGC CGCCCGCACC
GCGCGCGAGG CGCGGGCGGA CGGTGGACAA GAAGTGGCGG AACCTGCCTT TGAGGGTGCG
CTCCCTAGTA ACACAGCACC CTATAACAGC AATACAGGAC AGGAATGGTT TAAAGCCCTT
CGTTGGGGCG TTGATAGCTT GTACCTTTCC TATCCCGGCG AACTCTCCCG AGAGTCAGAC
CTTCGCCTCA AGGAACTGAA GCAATTCGCA CAATCCAATG ATCCCGGCGA GGTAGCAAAG
GCCCAGTTGC CACTGGCTGG TCATATCTTC GAGGTGAAGG AAAAAGGCGC GTCGCTATTC
CCCTACATCC TGGAAGATGG CGCTTTCCGT ATTCAGCTTT CCCGGCCAGG CCACAAAGCC
CCGATGGCCT ATGTGAAGGT ATCGGCCAAG TTCCTGGCCC ATGTCGGCCC GGTCGGAGCC
GAACGCCAGC TGTATGCCTT GCTCTCCGAG TTGGGCGAGC TCAAGGAATC GGCCAACGTC
AGCCGAATTG ACCTGTTTGT CGATTTCCAA AGCGGTTTCG ATATGGAAGG CTGGGATCGT
CATGCCTGGG TGACGCGGGC CTCCTCGATC AACAGCTATG CCGTGTCCGG GCAGTTCTCC
GGCTGGTCAG TGGGTCTTGG TGGGAACATC TCGGCCAGGC TCTATAACAA GCTCCTGGAG
ATCGTCGTCA GCGGCAAGGA ATGGATCATT CCCCTATGGC AGAAATCCGG TTGGGATGCC
TCGGCTCTGG TGTGGCGTCT GGAGTTTGAG ATCAAGCGGG AAGTCCTGAC TCAGAAGGGC
CTTTCCAAGC TCGCTGAGGT GATGAGCAAC TTGAACGGGT TATGGGACTA CGCAACAACG
GAATGGCTGC GCCTGACGCT GCCCAATGCG GAGGACAAGA CCCGTTCCCG GTGGCCGATT
CATCCTCTGT GGCTGTATCT ATCTGCCGTC GATTGGGAGA GCAAAGGCGG CCCCCTGGCT
AAACGTTTCA GTCCGAGCCG CAGCCCCAAT GACGACAAGC TATTCCAGAT CGGCTACAGC
GCGATTCTGT CGTACATGGC CAAGCATGGT TTCCCAGCTT CGGAGTTGTA CGAAGGCTGC
GAGGATTTCC TGGCCAGTGC CTATGCCTAT CACGAGCAGA AGGCGCTTGA CCTGGGCCTG
CCCTTCGAGG ACTTCATTGC TGAGAAGCTG GCCCTGAAGC ATCGCCAGTA CAACACGGCG
GTCAATGATC CCGACCAGGA AGCCAAGCGC AAGGCCAAGG CCCTGGAGGA TGAAGCCAGG
GCTTACCGGA AAGCCTCGGG GGGCTGA
 
Protein sequence
MPGKSQKETD MSKHQENGRD GLTEQQPQNL TATSLAAART AREARADGGQ EVAEPAFEGA 
LPSNTAPYNS NTGQEWFKAL RWGVDSLYLS YPGELSRESD LRLKELKQFA QSNDPGEVAK
AQLPLAGHIF EVKEKGASLF PYILEDGAFR IQLSRPGHKA PMAYVKVSAK FLAHVGPVGA
ERQLYALLSE LGELKESANV SRIDLFVDFQ SGFDMEGWDR HAWVTRASSI NSYAVSGQFS
GWSVGLGGNI SARLYNKLLE IVVSGKEWII PLWQKSGWDA SALVWRLEFE IKREVLTQKG
LSKLAEVMSN LNGLWDYATT EWLRLTLPNA EDKTRSRWPI HPLWLYLSAV DWESKGGPLA
KRFSPSRSPN DDKLFQIGYS AILSYMAKHG FPASELYEGC EDFLASAYAY HEQKALDLGL
PFEDFIAEKL ALKHRQYNTA VNDPDQEAKR KAKALEDEAR AYRKASGG