Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_0860 |
Symbol | |
ID | 3569847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | + |
Start bp | 931501 |
End bp | 932688 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637679318 |
Product | MFS family nucleoside/H(+) symporter |
Protein accession | YP_284086 |
Protein GI | 71906499 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00000000354031 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000116391 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATGCTT TGCCTTACTG GCGCCTCTCG GGCTATTACT TTTTCTATTT CGCCTTCATT GGCGCGTTTT CGCCCTATTT CGGGCTCTAT CTCCAATCCC TGAGTTTTTC TGCCTGGGAC ATCGGACTGT TGATGTCGCA GATGCAGCTG ATGCGTCTGT TTGGCCCCAA TCTGTGGGGG TGGCTGGCAG ATCGTTTCGG TCGGCGCCTA GCGATCATTC GTCTGGCGGG GCTGATTGGC CTGGCCGGAT TTACCGCGTT TTTCTGGCTC GATAAATTAC CTGCCATGCT GCTTGCGATG GCAGTTTTGG CATTTTTTTG GAGTGCCGCG CTACCTCTGG TTGAAACGCT GACCTTCGAT CATCTGCGTG ACGAGCGTGG GCGTTACAGT CGTATCCGTC TTTGGGGTTC GATCGGGTTC ATCATTGCCG TCATGGCGAC CGGGGCGCTA CTCGATATTG CGCCGCCGGT GGGCGTGCTC TGGGTGTGCT GGGTGATCCT GGTTGGCATT CTGCTCTATG CGCTGACCTT GCCCGAGTCG CCACCGCTTC CGCACCCCCG AGACGCTCAG CCGATTGGCG CCATTCTCCG TCAGCCGAAA GTGTTCGCGC TGATGGCCGC CTGTTTTGCC ATGTCGGCAG CGCACGGCGC CTTCTATGTC TTCTATTCGA TCCATCTCGA TGCCCATGGC TACACCAAGA CAGAGGTTGG GCTGCTTTGG TCGCTCGGTG TGGTGGCGGA AATCGTCGCT TTCATGTGCA TGGCGCGCCT GGCGAAGCGT TTTTCGCTAC GAACCATTCT GCTGGCCTGT TTTGCCGCCG CCGTCGTTCG TTTTTTGCTG ATGGGCTGGG GCGTCGAGTC TGCCGCTATC ATGATTTTTG TCCAGTTGCT GCATGGTCTG ACCTTTGGGG CCTATCACGC ATCGGCAATC GCCGCAGTCA ATCTATGGTT TCCCGGCAAG ACTCAGGGGC GGGGGCAGGC GCTATATTCC AGCCTGTCTT TCGGTGCCGG TGGTTTGCTC GGCGCCTTGA TCAGTGGTCG GACCTGGGAC TGGCTAGGCT CCGGTTGGAC TTTCACGCTT GGCTCAGTTT TCGCGCTGAT TGGCTTGTTT CTGATCTGGG GCTGGGTTGG CGGTAATGCG GTGCTTGATG CCCCGGAACG GGCAAAAACC GGCGATCCTG TGCAATAA
|
Protein sequence | MHALPYWRLS GYYFFYFAFI GAFSPYFGLY LQSLSFSAWD IGLLMSQMQL MRLFGPNLWG WLADRFGRRL AIIRLAGLIG LAGFTAFFWL DKLPAMLLAM AVLAFFWSAA LPLVETLTFD HLRDERGRYS RIRLWGSIGF IIAVMATGAL LDIAPPVGVL WVCWVILVGI LLYALTLPES PPLPHPRDAQ PIGAILRQPK VFALMAACFA MSAAHGAFYV FYSIHLDAHG YTKTEVGLLW SLGVVAEIVA FMCMARLAKR FSLRTILLAC FAAAVVRFLL MGWGVESAAI MIFVQLLHGL TFGAYHASAI AAVNLWFPGK TQGRGQALYS SLSFGAGGLL GALISGRTWD WLGSGWTFTL GSVFALIGLF LIWGWVGGNA VLDAPERAKT GDPVQ
|
| |