Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4322 |
Symbol | |
ID | 4596840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4571149 |
End bp | 4572405 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639778932 |
Product | major facilitator transporter |
Protein accession | YP_925506 |
Protein GI | 119718541 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.369395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCGT ACCGGACCCT CGCCCACAAC CGGGACTTCA CCGTGCTGTG GTGCGCCCAG ACCATCTCCG AGCTGGGCTC GCGGGTCAGC AGCTTCGCGA TGCCGCTGGT CGGGTACGCG ATGACCGGCT CGGCGTTCTG GGCCGCGGCC GCGGAGGCCG CGTACCTGCT CGGGATGGTC GTGATGCTCC TGCCCGGCGG GGTGCTCGCC GACCGCTGCG ACCGGCGCCG GCTGATGCGG CTCTCGCACG GCGGCGGCGC GCTGCTGTAC GCCTCGCTGG TCACGGCGGG CATGCTCGAC GTGCTCACCC TCCCCCACCT GCTCCTCGTG GCGCTGCTGA CCGGCCTCGC GGCCGGCCTC TTCGTCCCGG CGGAGGGCTC CGCGATCCGC ACGGTGGTGG CGGCCGACGA CCTGCCCACC GCCCTGAGCC AGCAGCAGGC CCGCCAGCAC GTCGCCTCGC TGGTCGGTGG CCCGCTCGGC GGCGCTCTCG TCGCCGCCAC CCGATGGGCG CCCTTCCTGT TCGACGCGAT CACCTACGCC GCGGGCTGGG TACTGCTCGG CCGGATCCGG GCCGACCTCT CCGGCCGACC GCAGGCCGGC ACGGGGCGCG CCCTGCACGA CCTGGGCGTG GGCCTCCGGT TCACCTGGTC GAGACCGTTC TTCCGCGTGC TGCTGCTGTG GTCGCCCCTG ATCAACCTGA CCGTTAACGC CCTGTTCTTC GTCGCGCTGC TGCGGCTGGT CGAGGCCGGC TTCCCCGCCT TCCAGATCGG GCTCGTGGAG GCGACGATCG GCAGCTGCGG CATCCTCGGC GCGCTGGCCG CGCCGTGGCT GATCGACCGG CTGGCGACCG GGACGCTGAC CGTGGCCGTC GCCTGGAGCT TCGTCCCGCT CTCGGTGCCG CTGGCCCTCT GGAACCACCC CGTGGTGATG GCGGCGGCCG CCTCGGTGGG GCTGTTCCTC AACCCCGCGG GCAACGCCGG TGTCGGGTCC TACCGGATGG CGGTCACCCC GTCGGAGCTG GTCGGCCGGG TGCAGTCGGC GATGCAGTTC ACCTCGATGC TCTCCATGCC GCTGGCGCCC GCGCTCGCGG GCGCGCTGCT CACCGGGCTC GGCGGGCCGG CGGCGGTCCT CGCGCTGACC GGACTCACCG CTGCGGTCGC CCTGATCCCC ACCCTGTCCA CCTCCGTCCG CTCGGTCCCC CGCCCGGCCG ACTGGCCGCG CTACGAGACG CCCATCGTGG CCTCCGCCGC CGCCTGA
|
Protein sequence | MTSYRTLAHN RDFTVLWCAQ TISELGSRVS SFAMPLVGYA MTGSAFWAAA AEAAYLLGMV VMLLPGGVLA DRCDRRRLMR LSHGGGALLY ASLVTAGMLD VLTLPHLLLV ALLTGLAAGL FVPAEGSAIR TVVAADDLPT ALSQQQARQH VASLVGGPLG GALVAATRWA PFLFDAITYA AGWVLLGRIR ADLSGRPQAG TGRALHDLGV GLRFTWSRPF FRVLLLWSPL INLTVNALFF VALLRLVEAG FPAFQIGLVE ATIGSCGILG ALAAPWLIDR LATGTLTVAV AWSFVPLSVP LALWNHPVVM AAAASVGLFL NPAGNAGVGS YRMAVTPSEL VGRVQSAMQF TSMLSMPLAP ALAGALLTGL GGPAAVLALT GLTAAVALIP TLSTSVRSVP RPADWPRYET PIVASAAA
|
| |