Gene Daro_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1949 
Symbol 
ID3567878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2101581 
End bp2102738 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID637680420 
Productaminotransferase, class V 
Protein accessionYP_285165 
Protein GI71907578 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.00000781244 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.177382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGCC CCGTCTATCT CGACTACAAC GCCACGACGC CGCTCGATCC TGCGGTGCTG 
GCAGCAATGT TGCCCTGGCT GGAAAGCCAG TACGGCAACG CCTCGAGTCG TCACGAATAT
GGCCGGCAGG CACGGCAGGC GATTGATGAG GCGCGACAGA GAGTTGCAGC GGCGGTCAAT
GCGCACCCGA CGGAAGTGAT TTTTACCAGC GGTGGCAGCG AAGCCAATAA CCTCTTTCTG
AAGGGCGCTG CGGCCAGTCT CAAACCGGGC ACGTTGGCCG TAAGTGCCAT CGAGCATCCC
TGTGTGCTCA AACCGGCCGC CCAGTTGGTA AAGCAGGGTT GGCAGGTCAA ACATATCGCA
GTCGATAGCG CCGGAAGGGT GAGTGCGGCG GATTACGCCG AAGCCATGCA GGCCAAACCA
AAGCTGGTGT CGGTGATGCT TGCCAATAAC GAAACCGGTG TCGTGCAGGA TGTCGCTGTG
CTGGCAAACT CGGCAAAGAG CGCTGGCGGC TGGTTTCATA CCGATGCCGT CCAGGCCTTG
GGGAAGCTGG ATATCGACTT TCGCGCCCTC AACATGGCCG GCGTGCATGC CATGACGCTA
TCTGCCCACA AGGCCTACGG CCCGAAAGGT GCAGCAGCGC TGGTTCTCGA CAAGCGTGTC
GAATTGCAGC CGCTGATTGC CGGTGGTGGC CATGAGCGAG GCTTGCGTTC CGGCACTGAA
AACGTGCCGT CGATTGTCGG ATTTGGCGTT GCTGCGGAAC TTGCAGCGAA TCGTGTTGCC
GAACTGTCGG CTCGCTTGCG AGTCATGCAG GCGAAGCTGG AAGCCGGGCT GGTTGCATTG
GGTGCCCGGG TCTTTGCGAC AGATGCGATG CGTTTGCCGA ACACCAGCTA TTTCGCCTTT
CCGGATATCG ATGGCGAAAC GCTGGTCGGC AAGCTGGACC GCGAAGGGTT TGCTGTGGCT
AGCGGCGCGG CATGTTCCAG CGCCAATCCG GAGCCATCGC ATGTTCTGCG GGCAATGGGT
GTGGCGCCGG AAATCGCCCG TGGGGCAATA CGTGTCAGCC TCGGGGCAAG TAACACTGAA
GTTGAAATTG AACAATTCAT CAACGCCTTG CAGGCTACAG TCGGACGCCT GCAGGGACTG
ACGGCGATGG CTGTCTGA
 
Protein sequence
MFRPVYLDYN ATTPLDPAVL AAMLPWLESQ YGNASSRHEY GRQARQAIDE ARQRVAAAVN 
AHPTEVIFTS GGSEANNLFL KGAAASLKPG TLAVSAIEHP CVLKPAAQLV KQGWQVKHIA
VDSAGRVSAA DYAEAMQAKP KLVSVMLANN ETGVVQDVAV LANSAKSAGG WFHTDAVQAL
GKLDIDFRAL NMAGVHAMTL SAHKAYGPKG AAALVLDKRV ELQPLIAGGG HERGLRSGTE
NVPSIVGFGV AAELAANRVA ELSARLRVMQ AKLEAGLVAL GARVFATDAM RLPNTSYFAF
PDIDGETLVG KLDREGFAVA SGAACSSANP EPSHVLRAMG VAPEIARGAI RVSLGASNTE
VEIEQFINAL QATVGRLQGL TAMAV