Gene Daro_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4043 
Symbol 
ID3567040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4342662 
End bp4343828 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID637682515 
Productextracellular ligand-binding receptor 
Protein accessionYP_287239 
Protein GI71909652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTC GCCTGACTGC CCTTGCAGCC GCCCTGATGC TTGCCGGTTC CGCCCATGCC 
GCCGACCAGA TCAAGGTTGG GCTGGTTTCG ACACTGTCTG GCCCTGGCGC CGGCCTCGGC
GTCGATATTC GCGACGGCTT CAATCTGGCC ATGAAGCATC TGAACGGCAA GCTGGGCAAT
CTGCCGGCCG AGGTGCTGAT CGCCGACGAT CAGCAGAATC CGGATATCGC CAAACAAACG
GCCGACAAGT TCCTGAAGAA GGACAAAGTC GATTTCATGA CTGGCATTGT CTTCTCCAAC
ATCATGCTCG CCGTCGGCCC GACCGTCTTC GAGAACAAGA CCTTCTACAT TTCGGCCAAC
GCCGGCCCGT CGCAGTATGC CGGCGAGCAG TGCAATCCCT TCTTCTTCAA CGTCGCCTGG
CAGAACGACA ACCTGCACGA AGCCGTCGGC AAGGTGGTGC AGGACAAGGG CTACAAGAAC
GTCGTGATCG TCACCCCGAA CTACCCGGGC GGCAAGGATG CAGTGTCTGG TTTCAAGCGC
TACTACAAGG GCAAGGTGGC CGACGAGATC TACACCAAGC TCGGCCAGCT CGACTATGCC
GCCGAACTGG CGCAGATTCG CGCCACCAAG CCGGATGCAC TGTTCTTCTT CCTGCCGGGC
GGCATGGGCA TCAACTTCGT CAAACAGTTC GTTTCAGCCG GCCTGTCGCG CGACACGCAG
CTGTTTGCCC CCGGCTTCTC GGCCGATGAG GACGTGATCA AGGCCGTCGG CAGCTCGATG
ATGGGCATGT TCAACTCGTC GCACTGGGCG CACGACATGG ACAATGCCGA GAACAAGCGC
TTCGTTGCCG ACTTCCAGAA GGAATATGGC CGCCTGCCCT CGCTCTACGC TTCGCAGGGC
TACGATGCGG CGCTGATGAT GGATGCCGCC GTGCGTGATG TGAAGGGCAA GGTCGAGGAC
AAGGCCGCGT TGCAGAAGGC ACTGGAAGCC AAGCGCTTCA AGTCCGTGCG CGGCGACTTC
AAGTTCAACA CCAACCACTA CCCGGTGCAG AACTACTACC TGCGCGCCAT CGGTAAGGAT
GCCCAAGGCC GGGTAACGAA CAAGACCATG GGCACCATCT TCACCAACCA TGCGGATGCC
TACGTTGCTT CCTGCAAGAT GAAGTGA
 
Protein sequence
MSLRLTALAA ALMLAGSAHA ADQIKVGLVS TLSGPGAGLG VDIRDGFNLA MKHLNGKLGN 
LPAEVLIADD QQNPDIAKQT ADKFLKKDKV DFMTGIVFSN IMLAVGPTVF ENKTFYISAN
AGPSQYAGEQ CNPFFFNVAW QNDNLHEAVG KVVQDKGYKN VVIVTPNYPG GKDAVSGFKR
YYKGKVADEI YTKLGQLDYA AELAQIRATK PDALFFFLPG GMGINFVKQF VSAGLSRDTQ
LFAPGFSADE DVIKAVGSSM MGMFNSSHWA HDMDNAENKR FVADFQKEYG RLPSLYASQG
YDAALMMDAA VRDVKGKVED KAALQKALEA KRFKSVRGDF KFNTNHYPVQ NYYLRAIGKD
AQGRVTNKTM GTIFTNHADA YVASCKMK