Gene Daro_2712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2712 
Symbol 
ID3566920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2906111 
End bp2907328 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content62% 
IMG OID637681179 
Productextracellular ligand-binding receptor 
Protein accessionYP_285912 
Protein GI71908325 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.620726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00485368 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA TTTCAGCCGT CATCAGCGGC ATCGCACTCT GCATCTGCAC CCAAACTGCA 
CCGGCAGCAG AGCCCATCCG GATCGCCGTG ACTGGCCCCT ATTCGGGTCC CTCATCACCG
ATGGGCCAGT CCATGTTGGC TGGTGTCCGC CTGGCGATCA GCGAAATGAA TCTGGGTGGC
GGGCTGCTCG GGCGCCAACT GGTCCTGGTC GAAAAGGATG ACAAGGGGGA TCCGGCAACC
GGCAAGGAGG TGGTCGAGGC AGCCATTCGC CAGGACAAGG TGGTGGCCGG ACTCGGCGTG
GTCAACACCG GCGTCGCACT GGCCTCGCTG AAGGACTATC AGGACGCCCG TGTGCCAGTC
ATCGTCAATG TCGCCACCGG CAGCTCGGTG GCCCGGCAGT TTTTCCCCCC CGCCATCCCG
GACAGCTACG TTTTCCGCAA CTCGGCCAGC GACGACATCC AGGCTGCAAT GATCGTGCGC
GAAGCGGTCA AGCGTGGCCG CTACACCAAG CTCGCCATCC TCCACGACAG CACGCCCTAT
GGCGAACAGG GGCGCGACCA ATTGACCCGG GAACTTACCG CTCTCGAGCT CAAACCCGTG
GCAGTCGAAA GTTTCGCCCC CGGCACCAGG GATCTTGCCG CCAACCTGCA GCGAGCCCGC
GAGGCCGGTG CCGAGGCGAT CCTGACCTAT GCCATCGGCC CGGAACTGGC CGTCATTGCC
AACAGTCGGG CCAAGATGGG CTGGAAGGTG CCGATGATCG GCAGCTGGCC GTTGTCTCTG
CCCAACTTCA TCGACGCAGC CGGCAAGAAT GCGGAAGGCG CGCGCATGCC GCAAAGTTTC
GTCGAGGCAG CGAACAACTA TCGCCGTACT TCCTTCATCA GCGCCTACCA TCATGCCGAT
GGCAGCAAAC GCATTCCCTC GGCGGTCTCT GCCGCCCAGG GTTACGACTC GGCTCTGCTA
CTGATAGCGG CCATCAGCCA GGCCGGCAGC ACCGAAGGCC CGAAAATTCG CGCCGCGCTG
GAAAACCTGC AAAAGCCTGT CTACGGCGTG ATCACGACTT ACTCACCGCC GTTTTCGAAA
GACGACCACG AGGCGATCAG CGAAAACATG GTGCTGATGG GTGAGGTAAA ACAGGGGCAG
GTTGCTTTCG CCCACCCGGA TGACGAAAAG CGCAGCCTCC TCGTCCACCG CAAGCCGAAA
GTTCAGCCGG TCAATTGA
 
Protein sequence
MKKISAVISG IALCICTQTA PAAEPIRIAV TGPYSGPSSP MGQSMLAGVR LAISEMNLGG 
GLLGRQLVLV EKDDKGDPAT GKEVVEAAIR QDKVVAGLGV VNTGVALASL KDYQDARVPV
IVNVATGSSV ARQFFPPAIP DSYVFRNSAS DDIQAAMIVR EAVKRGRYTK LAILHDSTPY
GEQGRDQLTR ELTALELKPV AVESFAPGTR DLAANLQRAR EAGAEAILTY AIGPELAVIA
NSRAKMGWKV PMIGSWPLSL PNFIDAAGKN AEGARMPQSF VEAANNYRRT SFISAYHHAD
GSKRIPSAVS AAQGYDSALL LIAAISQAGS TEGPKIRAAL ENLQKPVYGV ITTYSPPFSK
DDHEAISENM VLMGEVKQGQ VAFAHPDDEK RSLLVHRKPK VQPVN