Gene Daro_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1443 
Symbol 
ID3569050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1569433 
End bp1570695 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content59% 
IMG OID637679911 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_284662 
Protein GI71907075 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC GTCGCAACTT CATCAAGGCA ACCGTTCTCG CCGCCGCCCT GTCGTCCATC 
GGCATGGCCG CCCACGCCGC CGATACCATC AAGGTCGGCG TACTGCACTC CCTGTCCGGC
ACCATGGCGA TTTCCGAGAC CGCGCTCAAG GAAACCGTGC TGATGACCAT CGAGGAAATC
AACAAGTCCG GCGGCGTACT CGGCAAGAAG CTCGAGCCGG TCGTCGTCGA TCCGGCTTCC
AACTGGCCGC TGTTCGCCGA AAAGGCCCGC CAGCTGCTGA CCAAGGACAA GGTTGCCGTC
ACCTTTGGTT GCTGGACTTC GGTGTCCCGC AAGTCTGTGC TGCCGGTCTA CAAGGAACTG
AACGGCCTGC TCTTCTACCC GGTTCAGTAC GAGGGCGAAG AACTGGAAAA AAACGTCTTC
TACACCGGCG CTGCGCCCAA CCAGCAGGCC ATTCCGGCCG TCGAATACTT GATGAGTAAG
GATGGCGGCG AAGCCAAGCG ATTCGTGCTG CTCGGCACCG ACTATGTCTA TCCGCGTACC
ACCAACAAGA TTCTGCGCGC CTTCCTGAAG TCCAAAGGCG TGGCCGACGC CGACATCATG
GAAGACTACA CGCCGTTCGG CCACAGCGAT TACCAGACCA TCATCGCCAA GATCAAGAAG
TTCGCTTCCG AAGGCAAGAA GACCGCCGTC GTCTCGACCA TCAACGGCGA TTCCAACGTG
CCGTTCTACA AGGAACTGGG CAATGCCGGC CTGAAGGCCA CCGATGTGCC GGTTGTCGCC
TTCTCGGTCG GTGAAGAAGA GCTGCGCGGT GTCGATACCA AGCCGCTGGT TGGCCACCTG
GCTTCGTGGA ACTACTTCAT GTCGCTGAAG AATCCGGAAA ACGACAAGTT CGTGAAGATG
TACCGCGAAT GGGCCAAGAA GGCCAAGCTG CCAAATGCCG ACAAGGTCGT GACGAATGAC
CCGATGGAAG CTACCTACAT CGGCATCCAC ATGTGGAAGC AGGCTGTCGA GAAGGCCAAG
TCCACCGACA CCGACAAGGT TATCGCCGCC ATGGCCGGCC AGACCTTCAA GGCACCGAGC
GGCATCCTGT CGAAGATGGA CGAGAAGAAC CACCACCTGC ACAAGTCGGT GTTCATCGGC
GAAGTGAAGG CCGATGGCCA GTTCAATGTC GTCTGGAAGA CGCCCGGCCC GGTCAAGGCC
CAGCCGTGGA GCCCGTACAT CGCCGGCAAT GACAAGAAGA AGGACGAGCC GGAAGCGAAG
TGA
 
Protein sequence
MSNRRNFIKA TVLAAALSSI GMAAHAADTI KVGVLHSLSG TMAISETALK ETVLMTIEEI 
NKSGGVLGKK LEPVVVDPAS NWPLFAEKAR QLLTKDKVAV TFGCWTSVSR KSVLPVYKEL
NGLLFYPVQY EGEELEKNVF YTGAAPNQQA IPAVEYLMSK DGGEAKRFVL LGTDYVYPRT
TNKILRAFLK SKGVADADIM EDYTPFGHSD YQTIIAKIKK FASEGKKTAV VSTINGDSNV
PFYKELGNAG LKATDVPVVA FSVGEEELRG VDTKPLVGHL ASWNYFMSLK NPENDKFVKM
YREWAKKAKL PNADKVVTND PMEATYIGIH MWKQAVEKAK STDTDKVIAA MAGQTFKAPS
GILSKMDEKN HHLHKSVFIG EVKADGQFNV VWKTPGPVKA QPWSPYIAGN DKKKDEPEAK