Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3926 |
Symbol | |
ID | 4598061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4132924 |
End bp | 4134297 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639778532 |
Product | major facilitator transporter |
Protein accession | YP_925111 |
Protein GI | 119718146 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCCCA CGTTCCGCTC CCTGCGCAAC CCGAACTACC GGCGCTACCT CGCCGGCAGC CTGGTGTCCA ACACCGGGAC CTGGATGCAA CGGGTGGCCC AGGACTGGCT GGTGCTCCAG CTGCCGGGCA ACAGCGGCAG CGAGCTCGGG ATCACCACCG GGCTGCAGTT CCTGCCGATC CTGCTGCTGA GCCCCTACGC CGGGGTCGTC GCGGACCGGT TCCCGAAGCG CCGGCTGCTG CAGGTCACCC AGGCGACGAT GGCGCTCGCG TCGCTGGCGC TCGGCCTGAT CGCGGTGCTC GGGGTCGCGC AGACCTGGCA CGTCTACCTG ATCGCGTTCC TCTTCGGCAT CGGCGCGGCC TTCGACGGTC CGGCCCGACA GTCGTTCGTC TCCGAGATGG TCGGCCCCGA GGACCTCACC AACGCGGTGG GCCTGAACTC CGCGAGCTTC AACGCCGCCC GGATCCTCGG CCCGGCCGTC GCCGGCCTGA TGATCGGCGC GCTCGGCGGT GGCGCGCAGG CGACCGGCTG GGTGATCCTC GTCAACGCCG CGTCGTACCT CGCCGTGATC GGCCAGCTCC AGCGGATGGA CGTGGTGCTG CTGCATCCGG CCGAGATCCA GGACCGCACG CCCGGCATGC TCGTCGACGG CGTCCGCTAC GTGCGCAGCC AGCCCAAGAT GGTGCTGATC CTGATCATGG TCTTCTTCGC CGGCACCTTC GGCATGAACT TCCAGATCAC GTCGGCGCTG ATGGCCACCG AGGTGTTCGG CAAGGGCGCG GGGGAGTACG GCGTCCTCGG CTCGGCGCTC GCGGTCGGGT CGCTGACCGG GGCGCTGCTG ACCGCCCGGC GAGTCCAGAT CCGGGTCCGG CTGCTGGTGC TCGCCGCGCT CGGCTTCGGC ACCGCCGAGA TCATCGGTGG CCTGCTCCCG TCGTACCTGT TGTTCGCGCT GTTCTCACCG GTCATCGGGT TCTTCACGCT GACCCTGCTC AGCTCGGCGA ACGCCACCCT CCAGCTGGAG GCCGCCCCGG CGTTCCGCGG CCGGGTGATG GCCCTCTACA TGACCATCCT GATGGGCGGC ACCCCGATCG GCGCGCCCAT CATCGGCTGG GTCGCCCAGC ACCTCGGTGC CCGGTGGGGC CTGATCATCG GCGGGACGCT GACCATCCTC GGCGTACTGT TGGCGCTGGC CGCCCATTCC CGCCTCCGCG GCGGGGTGCG CACCGTTTTG ACCGAGGTAG ACCACCCGGG TAACCTTTTT CCTCGTGTCT GGGACAACCA GGCCGTCGCG CGTGCCCGGA AGCAGTCGGG GAGTCAGGTC CTCGGATCCG GCACCGAAAC TCCAGCAGCG GAAGCCCTCA CCGGTTCTCG CTGA
|
Protein sequence | MSPTFRSLRN PNYRRYLAGS LVSNTGTWMQ RVAQDWLVLQ LPGNSGSELG ITTGLQFLPI LLLSPYAGVV ADRFPKRRLL QVTQATMALA SLALGLIAVL GVAQTWHVYL IAFLFGIGAA FDGPARQSFV SEMVGPEDLT NAVGLNSASF NAARILGPAV AGLMIGALGG GAQATGWVIL VNAASYLAVI GQLQRMDVVL LHPAEIQDRT PGMLVDGVRY VRSQPKMVLI LIMVFFAGTF GMNFQITSAL MATEVFGKGA GEYGVLGSAL AVGSLTGALL TARRVQIRVR LLVLAALGFG TAEIIGGLLP SYLLFALFSP VIGFFTLTLL SSANATLQLE AAPAFRGRVM ALYMTILMGG TPIGAPIIGW VAQHLGARWG LIIGGTLTIL GVLLALAAHS RLRGGVRTVL TEVDHPGNLF PRVWDNQAVA RARKQSGSQV LGSGTETPAA EALTGSR
|
| |