Gene Daro_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4097 
Symbol 
ID3566713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4392752 
End bp4393891 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content61% 
IMG OID637682569 
Productextracellular ligand-binding receptor 
Protein accessionYP_287293 
Protein GI71909706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGA AGCTGACCCT CGCCGTACTC GCCACCATCT CCACCGCGGC GCTGGCCGAC 
ATCAATGTTG GCGTTTCCGT CGCCGCCACC GGTCCGGCCG CCTCGCTCGG CATTCCGGAA
AAGAACACCT TCGCCCTGTT GCCGACCACC ATCGGCGGCC AGAAGGTCAA TTACATCATC
CTCGATGACG CGACCGATCC GACCGCGGCG ACCAAGAACA TCAAGAAGCT GATCAGCGAA
AACAAGGTCG ACGTCGTCGT CGGCTCGTCG ACCACGCCGA GTTCGCTGGC CATGATGGAT
GTCGCCGTCG AGAACGAAAC CCCGCTGATT TCCATGGCCG GTTCCGCCAT CGTCGTCGAA
CCGATGGACG ACAAGCGCAA ATGGGTATTC AAGACGGCCC AGAACGATGC CCACATGGCC
ACCGCGCTGG TCCAGCACAT GACCGACAAG AACGTGCAGA CCGTCGCCTT CATCGGCTTT
GCCGACGCCT ACGGCGAAGG CTGGTACAAG GAATTCGCCA AGATCGCCGA GGTCCGCAAG
CTGAAGATCG TCGCCAGCGA GCGTTACCAG CGCAACGATA CCTCGGTGAC CGGCCAGATT
CTCAAGATCA TGTCCGCCAA GCCGGATGCT GTTCTGGTCG GCGGTGCCGG TACCCCGGCC
GCCCTGCCAC AGAAGGTGCT TAAGGAAAAA GGCTACAAGG GCCTGATCTA CCAGACGCAC
GGCGTGGCCA ACAATGATTT CCTGCGCGTC GGCGGCAAGG ATGTCGAAGG CGCTTTCCTG
CCGGTCGGCC CGATGGTCGT TGCAGCGCAG CTGTCGAATG ACAACCCGGT CAAGAAATCG
GCACTGGAGT ATGTGAGCAA GTACGAAGCT GCCCACGGCA AGGGCAGCGT CAGCTCCTTC
GGCGGCCACG CCTGGGATGC CGGCGTACTG CTCGGCAGCG CCATTCCGGT TGCCCTGAAG
AAAGCCAAGC CGGGTACGGT CGAATTCCGT CGTGCCCTGC GCGACGCGTT GGAGAACACC
AAGAACGTCG CCGGCGCCCA CGGCATCTTC AACCTGACGC CGAACGACCA CCAGGGTTTC
GACCAGCGCG CCCGCGTCAT GGTGACCATC GAAAACAACA CCTGGAAACT GCTGAAATAA
 
Protein sequence
MIKKLTLAVL ATISTAALAD INVGVSVAAT GPAASLGIPE KNTFALLPTT IGGQKVNYII 
LDDATDPTAA TKNIKKLISE NKVDVVVGSS TTPSSLAMMD VAVENETPLI SMAGSAIVVE
PMDDKRKWVF KTAQNDAHMA TALVQHMTDK NVQTVAFIGF ADAYGEGWYK EFAKIAEVRK
LKIVASERYQ RNDTSVTGQI LKIMSAKPDA VLVGGAGTPA ALPQKVLKEK GYKGLIYQTH
GVANNDFLRV GGKDVEGAFL PVGPMVVAAQ LSNDNPVKKS ALEYVSKYEA AHGKGSVSSF
GGHAWDAGVL LGSAIPVALK KAKPGTVEFR RALRDALENT KNVAGAHGIF NLTPNDHQGF
DQRARVMVTI ENNTWKLLK