Gene Daro_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4098 
Symbol 
ID3566714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4394031 
End bp4396883 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content65% 
IMG OID637682570 
Productinner-membrane translocator:ABC transporter related 
Protein accessionYP_287294 
Protein GI71909707 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component
[COG0559] Branched-chain amino acid ABC-type transport system, permease components
[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCC AGATTCTCCT TCTTCTCGGC CAGGATGGCA TCACCAACGG TGCCATCTAT 
GCCTTGCTCG CCCTCGCGCT GGTGCTCGTT TTTGCCGTCA CCCGCGTCAT CTTCATTCCC
CAGGGCGAAT TCGTCGCCTT TGGCGCGCTG ACGCTGGCCT CACTGCAGGC CGGCAAGGTA
CCGGGCACGG TCTGGTTGCT GCTCGGCCTG GGTACCACCG TCGCCATCCT CGACGGTTAC
CGCCTGTTGC GTGAAGGCCG CTACGCCAAG CTGCCGCCGC TGCTCATCGT CAATGTCGGT
ATCCCGCTGC TCTTCGCCGC CGGGCTGTTC GCCATGCCGC CAACCCAGTT GCCACTGTGG
ACGCAGGTGG TCATCGCCAT GCTGCTGGTC GTACCGCTCG GCCCGATGAT CTACCGCATC
GCGTTCCAAC CCCTGGCCAG CGCGCCGGTA CTGGTGCTGC TGATCGTCGC AGTGGCCGTC
CATCTGGCGC TGATCGGCCT CGGCCTGCTC TTCTTCGGGG CCGAAGGCTG GCGCACCCCG
GCCTTTACCG ATCTCAGCTT CGAACTGGCC GGCGCCCCGC TGCAGGGGCA AACCGTCCTG
GTCGTCGTCG CCTCGGCACT ACTGATCATC GGGCTGTATT TCTTCTTCGA ACGCACCATC
TACGGCAAGG CGCTGCGTGC CACCGCGATG AACCGCACCG GCGCCCGCCT GATGGGTATT
CCGCCCGTGC TGGCCGGCAA GCTGTGCTTC ACGCTGGCCG CCGCCATCGG CGCCTTCTCC
GGCATCCTGA TCGCCCCGAT CACCACCATC TACTACGACT CCGGCTTCCT GATCGGCCTC
AAGGGTTTCG TCGGCGCCAT CATCGGCGGC CTGGCCTCCT ATCCGCTGGC GGCACTGGGC
GCCATCCTGG TCGGCCTGCT TGAATCTTAT TCGTCCTTCT ACGCCAGCGC CTTCAAGGAA
GTCATCGTTT TCACGCTGAT CATCCCGGTG CTGTTGTGGC GCTCGCTGAC CGCGCACCAC
ATTGAGGAAG AGGAAGAAAA GGGCGACGAG GCACCGGCTG GCAAGATCAC CACCGGCAAG
CGCAGCGCCA GCCTGATCCG TCACCTGCCG ATGCTGGCCT TCGTTGCCGT GCTCGGCATC
AGTCCGCTGC TACTGCCTGA ATTCACCATC ACGCTGCTCA ACTACATCGG TCTCTACGCC
GTCGTCGCCG TCGGTCTCGT CCTGCTGACC GGTGTCGGCG GTCTGACTTC GTTCGGGCAA
GCGGCTTTCG TTGGCCTCGG TGCCTACACG ACGGCCTGGC TGACCACGGT TTATGGCCTG
TCGCCGTGGC TGACCCTGTT CATCGGCATG GCCATCACGG CCGCCGTCGC CCTGTCGCTC
GGCTTCATCA CGCTGCGCAT GGGCGGCCAC TACCTGCCGC TGGGGACCAT CGCCTGGGGC
ATCAGCCTGT ATTTCCTGTT TGGCAACGTC GAATTTCTCG GCGGCCACAC CGGCATCACC
GGCATCCCGG CCGTCTCGCT GTTCGGCTGG GAGCTGAAAT CAGGCCGGGA ATTCTTCTAC
CTGATCTGGC TGGTCGTGCT GCTGGCCATC GTCAGCATCC GTAACCTGCT CGATTCCCGC
GTCGGCCGGG CCATCCGGGC ACTCAAGGGC GGCACGGTGA TGGCCGAGGC GATGGGCGTC
AATACGGCCC GCGAGAAGAT CATCATCTTC CTGATCGCCG CCCTGCTGGC CAGCATTTCC
GGCTGGCTCT ATGCCCACCT GCAGCGCTTC GTCAATCCGA CGCCGTTCGC GCTGAACCAG
GGTATCGAAT ACCTGTTCAT GGCGGTGGTC GGTGGCATCG GCCACGTCTG GGGCGCCGTG
CTCGGCGCCG GCGTGATCAC CATCCTCAAG CAGTGGCTGC AGGACTTGCT GCCGCAATTG
CTCGGCCAAA GCGGCAATTT CGAGGTCATC GTCTTCGGCA TCGCCATGGT GTTGATCCTG
CAGAAGGCGA GGGCAGGCTT GTGGCCCGTC ATCCTGCGTC TGCTGCCTTC GCGCAGCATC
ATCCGCGAAG TGCCGCAGGC CGAGGCCCTG CCCAAGCGCC AGCCGACCGC TGCCGGCAGC
TTGTTGCTGG AAGCCAGCGA GGTGACCAAG CGTTTCGGCG GACTGGTCGC CAACAACAAC
ATGAGCCTGA CCGTGCAGGC TGGCGAAGTG ATGGCGCTGA TCGGCCCGAA TGGCGCCGGC
AAGAGCACGA TGTTCAATTG CATCTCGGCA GTCAATCCGG CCACCGAGGG CAAGATCGCC
TTCCTCGGCG AATCGACGGC CGCATTGGCC GCCCGTGACA TCGCCCGGCG CGGCATGAGC
CGGACTTTCC AGCACGTGCG GCTGCTCGGC AACATGAGCG TGCTGGAAAA CGTCGCCATC
GGCGCCCATC TGCGCGGTAG CAAGGGTGTC CTCGCCGCCG CGCTGCGCCT CGACCGGGCC
GAAGAAAACC GTCTGCTCGC CGAAGCCGCC CGCCAGATCG AGCGTGTCGG CCTGGCCGAG
CACATGTTCG ACGCCGCCGG CAGCCTGGCC CTTGGCCAGC AGCGCATCGT CGAAATCGCC
CGCGCCCTGG CTTCCGATCC CTGCCTGCTG CTCCTTGACG AACCGGCCGC CGGTCTGCGC
TACAAGGAAA AGCAGGCGCT GGCCGAACTG CTGCGCAAGC TGCGCGCCGA AGGCATGGGC
ATCCTGCTCG TCGAACACGA CATGGACTTC GTCATGGGAC TGGCCGATCG GGTCGTCGTC
ATGGAATTCG GAGAAAAGAT CGCCGAAGGT CTGCCGGAGC AAGTCCAGCA GGATCCGAAG
GTACTTGAAG CTTATCTGGG AGGGGTCGAA TAA
 
Protein sequence
MDLQILLLLG QDGITNGAIY ALLALALVLV FAVTRVIFIP QGEFVAFGAL TLASLQAGKV 
PGTVWLLLGL GTTVAILDGY RLLREGRYAK LPPLLIVNVG IPLLFAAGLF AMPPTQLPLW
TQVVIAMLLV VPLGPMIYRI AFQPLASAPV LVLLIVAVAV HLALIGLGLL FFGAEGWRTP
AFTDLSFELA GAPLQGQTVL VVVASALLII GLYFFFERTI YGKALRATAM NRTGARLMGI
PPVLAGKLCF TLAAAIGAFS GILIAPITTI YYDSGFLIGL KGFVGAIIGG LASYPLAALG
AILVGLLESY SSFYASAFKE VIVFTLIIPV LLWRSLTAHH IEEEEEKGDE APAGKITTGK
RSASLIRHLP MLAFVAVLGI SPLLLPEFTI TLLNYIGLYA VVAVGLVLLT GVGGLTSFGQ
AAFVGLGAYT TAWLTTVYGL SPWLTLFIGM AITAAVALSL GFITLRMGGH YLPLGTIAWG
ISLYFLFGNV EFLGGHTGIT GIPAVSLFGW ELKSGREFFY LIWLVVLLAI VSIRNLLDSR
VGRAIRALKG GTVMAEAMGV NTAREKIIIF LIAALLASIS GWLYAHLQRF VNPTPFALNQ
GIEYLFMAVV GGIGHVWGAV LGAGVITILK QWLQDLLPQL LGQSGNFEVI VFGIAMVLIL
QKARAGLWPV ILRLLPSRSI IREVPQAEAL PKRQPTAAGS LLLEASEVTK RFGGLVANNN
MSLTVQAGEV MALIGPNGAG KSTMFNCISA VNPATEGKIA FLGESTAALA ARDIARRGMS
RTFQHVRLLG NMSVLENVAI GAHLRGSKGV LAAALRLDRA EENRLLAEAA RQIERVGLAE
HMFDAAGSLA LGQQRIVEIA RALASDPCLL LLDEPAAGLR YKEKQALAEL LRKLRAEGMG
ILLVEHDMDF VMGLADRVVV MEFGEKIAEG LPEQVQQDPK VLEAYLGGVE