Gene Daro_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3540 
Symbol 
ID3567618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3790830 
End bp3791963 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content56% 
IMG OID637682013 
Productextracellular ligand-binding receptor 
Protein accessionYP_286739 
Protein GI71909152 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value0.229705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.129038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGA AGCTGGTGGT CAGGGGGGTA GTCGGGGTGC TGGGCGTTGT TTCCTCAAGT 
TTCGTCATGG CTGAGACAGG CGTAAGCGAT AGCACGATTT TGGTCGGCCA GTCTGTTCCG
CTCAGCGGCC CGTCGCAGGA ACTGGGCAGC GAGATGAAAC TGGGGGTCCA GCTTTATTTT
GACCAGGTGA ATAGCCAGGG GGGCATCAAC GGTCGAAAGC TGGAACTGAA GGTATTGGAT
GATGGCTATG AGCCGGAACG CACCGCCGCC AATACGCGTC AGCTGATCGG AAAAGAGGGC
GTATTTTCCT TGCTTGGTTT TGTCGGAACA CCGACCAGTG TCGCAGCACT GCAGGTTTCG
AATCCAGCCA AAGTGCCTTT TGTTGGTGCT TTCACCGGCG CTGATGCCTT GCGCGTACCG
TTCAATCGGT ACGTATTCAA TATCCGGGCA AGCTACGCCG AGGAGTGCGA GCGGATTGTC
GAGCAGTTCA CTTCGCTGAA CGTCAAGCGC ATTGCCGTGT TTTTCCAGAA CGATGGCTTC
GGCAAGGCTG TTCTGAGTGG CGTCGAGCGA GCGATGGAAA AGCGAGGCTT GCAGGTAATC
AATAGCACCG CTATCGAACG GAATTCGCTG AACGTTGCTC CCGCGGCAAA AGCGATTGCC
GCAGTTCGCC CGGACATGGT GATCATGGCT GTCCCCTACA AGCCCAGCGC AGCCTTTATC
AACGCGATTC GCGGCGAAGG CGCCGCACCG CAGTTCTATA CGGTTTCGTT TGTCGGGCCG
CAAGCCCTGG CCAAGGAACT TGGGCCCAAT GGCGCGGGGG TGGGCATTTC CCAAGTGATG
CCTTTCCCTT GGTCGATCAA CGCGCCGATT GTTCGTGAGT ATCAGAAGTT GCTGCAGGGC
AAGGCTGAAC CCTCATACGT CAGCCTGGAA GGCTTCATCG CGGCGAAAGT TTTTGTCGAA
GGGGTGAAAC GCGCCGGCAA GGATCTGACC CGTGAAAAGC TCATCGGTGC CATGGAAGGG
ATGCACAGCT ACGATACCGG TGGTTATGCC GTCAGCTTTG GTCCAAACGA TCATAACGGC
TCGAAATTCG TCGAACTGAC GGTTCTGCGC CGTGATGGGA AAATCATGCG CTGA
 
Protein sequence
MNWKLVVRGV VGVLGVVSSS FVMAETGVSD STILVGQSVP LSGPSQELGS EMKLGVQLYF 
DQVNSQGGIN GRKLELKVLD DGYEPERTAA NTRQLIGKEG VFSLLGFVGT PTSVAALQVS
NPAKVPFVGA FTGADALRVP FNRYVFNIRA SYAEECERIV EQFTSLNVKR IAVFFQNDGF
GKAVLSGVER AMEKRGLQVI NSTAIERNSL NVAPAAKAIA AVRPDMVIMA VPYKPSAAFI
NAIRGEGAAP QFYTVSFVGP QALAKELGPN GAGVGISQVM PFPWSINAPI VREYQKLLQG
KAEPSYVSLE GFIAAKVFVE GVKRAGKDLT REKLIGAMEG MHSYDTGGYA VSFGPNDHNG
SKFVELTVLR RDGKIMR