Gene Daro_1440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1440 
Symbol 
ID3569047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1565107 
End bp1566303 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content56% 
IMG OID637679908 
Productextracellular ligand-binding receptor 
Protein accessionYP_284659 
Protein GI71907072 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value0.615682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGGG TATGCGCTGC GCCAACGGCA TCGCCAATTA TCATTGCCCA TACTGCGCCA 
ACAACCGGGC GATTTTCCTT GCATGCCGAA TCCGATCGGC GTGGTGCCGA GATGGCAATC
GACGAATTCA ATGCGCGGGG CGGTGTCCTG GGCCGGGAAA TCGTACTTGT TTCACGCGAC
CCCGGTCTCG ATCCAAAACG TGCAGCCTCG GTTGCCGAGG AGCTCATTAC ACAGACCAAG
GTTGGATTCA TCGTGGGAGC GCTTGCTTCT GGCGTTGCTG CGAGCATGTC TGCGGTCTGC
CAGAAATACG GTGTGATCTA CATCAACACC AATTCATCGG CGCCCTCTGA ATCGGTCGAG
AACGCCCATC GCACAAAGTT CGCCTTTGAC GCTCATGGCG CGAACTTCAA CCGGGCGCTC
CTGAAGTATG CGCTGGCCAA TCGCAAATCA AAGCGGGTCC TGCTGTTGAC TGAGGATAAT
GAATTCGGTC GCAGCAATGC GCTAGCCATG CGCCCCTATA TCGCTGAGTA TGGGGGGACG
GTAGTTGGTG AAGTTGTCAC GCTGGAGACG CTACCTGATC CTGTCGAGAT CCTGAGAAAA
GTGGCCGCTA CGCCGGCTGA TGTCGTCGCC GTCAGTATCA GTGGTGACAA CCAGATCAAG
CTGTTTTCAC AGATTGATCC GAAGGTACTC GAAAAGCAAT TCTGGATCGT TGGCGAGGTG
GACTGGGAAG AGCTTTATCC CGCCCCAGGA ACGCCACGAC CGCTATTCGG TACCACTTGG
GCCTGGAATC TGAAAACCCC GGGAACCGCA GACTTCGTCG CGCGTTACCG GAAACGCTAT
GGCCATACCA AGCTCAACTA TCCGGGAGAT GTCACTCATG CAGCCTATCT CGCTACAAAG
GCGTTGCTCG TCGCTATCGA GCAGGCCGGC AGTACGGATA GCCATGCGGT CATCCAGCAA
CTCGAAAGCT ACAAGGCAAC TGCCAAGGAG CGTATGCAGG ATGCCGCAGC ATTCATGGAC
CCGAACAGTC ACCATGTCCA GCAAAGCATT TACATCGCTC GCTGGAATCC CATTGCTGCT
CGGCCTGAAC AAGGTATCGA GATAGTTGGG CATATCCCTC CTGAGCAAGT CCGCTATGAA
CCCGAACGGA CAACCCATCT GGAATTGCTG GCAGATACAC CACATTACGC AAAATAA
 
Protein sequence
MPGVCAAPTA SPIIIAHTAP TTGRFSLHAE SDRRGAEMAI DEFNARGGVL GREIVLVSRD 
PGLDPKRAAS VAEELITQTK VGFIVGALAS GVAASMSAVC QKYGVIYINT NSSAPSESVE
NAHRTKFAFD AHGANFNRAL LKYALANRKS KRVLLLTEDN EFGRSNALAM RPYIAEYGGT
VVGEVVTLET LPDPVEILRK VAATPADVVA VSISGDNQIK LFSQIDPKVL EKQFWIVGEV
DWEELYPAPG TPRPLFGTTW AWNLKTPGTA DFVARYRKRY GHTKLNYPGD VTHAAYLATK
ALLVAIEQAG STDSHAVIQQ LESYKATAKE RMQDAAAFMD PNSHHVQQSI YIARWNPIAA
RPEQGIEIVG HIPPEQVRYE PERTTHLELL ADTPHYAK