Gene Noca_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3966 
Symbol 
ID4598101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4179789 
End bp4183016 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content73% 
IMG OID639778571 
Productmajor facilitator transporter 
Protein accessionYP_925150 
Protein GI119718185 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTACAG GTGGACGTGG GATGGTCGGT CGGTCACCAG GAGCCGCCGC CCCCGCCGCC 
GCCTCCGCCG CCACCCGAGA ACCCGCCGCC GCCCGAGGAC GACGACTTCT GGGTGGCCTC
GTAGGAGGAG ATCGCGGAGC TGACGGTGGA GTGGAAGTCG TCGACCATCG ACGAGGCCAC
CGAGCCGGTG TAGGCGCCCG CGTACGCGGG ACCGAGGTAC GACGGGACCG GCGGCTCGGC
GCCCATCTCG GTGCGGTACT TGCCCGCCCA CTCCTCGGCG CACCCGAACG CCACCGCCCA
GGGCACGTAG GCGGTGTAGA ACTCCTCGCG TCCGGAGAAG TCGAACCGGT CCTGTGCGGA
CGGCGTCGCC AGGATCCGGC GGAACCCGCC GGCCCGCGAC CAGAGGTCGC GGCCGGCCTT
GGTGCGCCGG GTGCCCGCGC CGGAGGCCGC GAGGGAGGCG CCGAAGACCC CGAAGGCGCC
CGGCACCAGC CCCACGATCG ACATGTTCAG CGGGTTCCAG ATCGCGATCG CGACCACCGC
GGCGAGGCAG AGGCCGATGA CCACTCCGCC ACCGCCGCCC AGCCCGCTCC TGACCATCAG
GCCGGAGGTG GCGGCCCAGG TCCGGGTGCT CTCCTCGAAC GACTCGATCT CGGTCTTGAG
CCGCTGACCG GCCGCGACGT CCTTGGGCGC GGCGACGAAC GAGGTGCCCG GGCCGCCGAG
CAGGTGTGCC ACCCCACTGG TCACGGGATC GAGGCCGGCC CAGCCCGCTG CGCCGTTCTT
GTCCGAGATC GTCCACGTGC CGTCGGCGCG GTCCAGGTCG ACGGCACCCT TCTCGGCGGC
GTACATGAGG GTGGCGACGT ACTGCTCGTC GTCGACCTCC TCGGTGACCA GGTAGGCGGC
CTGGGCGGGG CCGATGCCGT CCGGCGGCGC GTACATCACC GGGAACTGCG GGTCCTTCTC
CCTGGCCCGC CGGGCCAGGT GGAAGCCGTA GCCGCCGGCC GCGGCGGCCA GGACGAGCAC
GACGGCCAGC GCGGCCAGGT TGGGCCCGAG CACGGCGTCC AGCGCCGGGC CCCAGGGTCG
CTCGTCGCCG GGCGGAGGCG TTGCCAGGTC CAGGCCCACC TTGACCGTCA CGGGGGTGCG
CGGGTCGAGC GACGTCGCCC GGATCCGCAG GTCCGCGGTG CCCTCCCCTC GGAGCCGGCA
GCCCGTCTCG TCGGAGCCGA CGGCGCACTG GACCCGCTCG GCCGCGGCGG GCAGGTGGAC
CGTGAGGTCG GCCGAGTCGA TGCGCTGGGC CCAGCCGCTG GGGATCAGGT TCCAGTAGAG
CTGGGTGCGG GAGCCGTTGG TGCCCGGCTC CAGGATGCCG GCGATGTCGT AGCGGATCTC
GTACACGTGC TCGCCGGGCT CGACGGTCGA GTCCGGATCG CCGATGCGGG CGACCTTGAA
CTGGCCACCG CCCTCGTAGG ACGTCTCGAC CGGGACGTCG GCACCGTCCA TCGTGACCTC
GATGCCGCGG GGGATCCGGC GGACGGTGTC CGGGGCCGTC TGGTCGTGGG TGTCCCAGAA
CCGGAAGATC CCGTGCTTGC CGGGGAACGG GAAGTCGACC GTGAGGGTCT CGACGGCCGT
CAGGTCGCCC TGGTCGTCCA CGTCGAAGTC CGCGACGTAC GACGTGATCG TGGTCTCGTC
CGACTCGGCG GGTCCCTCGT CGCCGCTCAC CCCGTAGAGC GCCGCGGGGA GCATCAGCAC
CAGCACCACG ACCGCCAGGC CGACCACGGT CCCCACCACA CGCTTCATGG CCCGGAGCCT
AGGGGGCGAC GGGACAGCCG CGTCGGAAAA CCGCGTGCCG GGTGTCGCCG GCCGCTGGGA
GACTGCTCGG CGATGAGCTC GCCGACCCGC CCCGACGCCG ACCGCCCGAC CCTCGCCTCC
TTCTGGCACG ACCTGCCCCG CGAGGGCCGG CTGCTGCTCT CGGTCGTGGT CTTCGAGTTC
ATCGGCACCG GCCTGGTGCT GCCCTTCCAC GTCGTCTACC TGCACGAGGT GCGCGGCTTC
GCGCTCAGCG ACGTCGGGCT GCTGCTCGCG CTCCCGCCCC TGATCGGCTT CCTCGTCGTC
GGGCCCGGCG GTACCGCGAT CGACCGGCTC GGCGCGCGCC GGATCCTGAT CGGCGCGCTG
GTCCTCCAGA CCGTCGCGAA CGTCACACTC GCGTTCTCGG CGGCGGAGTG GATGGCGGCG
GGGGGGCTGA TGCTCTCCGG CGCGGCGTTC GGGGTGTCGT GGCCGGGGTT CCAGGCCTTC
ATCGCCGCGG TGGTCCCGGT CGAGCTGCGG CAGCGCTACT TCGGCGTGAA CTTCACGCTG
CTCAACCTCG GCATCGGGAT CGGCGGCATC GTCGGCGGCG CGTTCGTCGA CGTGGACCGG
CTGGTCACCT TCCAGGTCAT CTACCTCGGC GACGCGATCA GCTACCTCCC CGCCCTGGTC
CTCCTGCTCT GGCCGCTGCG GCTGGTCGCC GGTCGGCCGG TCCACGAGGG CGGCGCCCCG
CCGGCGACGG TGAGCTACCG CGAGGTGATG CGTCGGCCCG CGGTCGCCTC GCTGATGCTG
CTCAGCTTCG TGTCGTCGTA CGTCGGCTAC TCCCAGCTCA ACGCCGGGAT GCCGGCGTTC
GCGCGCGCGG TGGGCGAGGT CTCGACGCAG GGCCTCGGGC TGGCGTTCGC CGCGAACACC
GTGGTGATCG TCGTGCTCCA GCTGGTCGTG CTCCAGCGGA TCGAGGGGCG GCGCCGCACC
CGGGTGATCG CGGTGATGTC GGTGGTCTGG GCGTGCTCCT GGGTGCTGCT CGGCGCCACC
GGGCTGGTCT CCGGCACGTG GGGCGCGACG CTCCTGGTCG CCGGCTGCGC GTCGGTGTTC
GCGTTCGGCG AGACCCTGCT GCAGCCGACC GTCCCCGCCC TCGTCAACGA GCTGGCCCCC
GACCACCTGC GGGGGCGCTA CAACGCGCTC AGCTCCGGGT CCTTCCAGCT CGCCGCGATC
ATCGCACCGC CGGTCGCCGG CTACCTCGTC GGCCACGGCC TGGGCAGCGT CTACATCGGC
TCGCTCGTCG TCGGCTGCCT GCTCTGCGGC GCGCTGGCGG TCCTGCGGGT CGAGCCACAG
CTGAGCCCCG AGGTCAACGG TGTGCGAGCT CCGGCGCAGG TCACCACGGC CGCGGACGTC
ACCGTCCCCG TGCCGACGGC CAAGACCCAG TCCAGCGCCC TGGACTAG
 
Protein sequence
MGTGGRGMVG RSPGAAAPAA ASAATREPAA ARGRRLLGGL VGGDRGADGG VEVVDHRRGH 
RAGVGARVRG TEVRRDRRLG AHLGAVLARP LLGAPERHRP GHVGGVELLA SGEVEPVLCG
RRRQDPAEPA GPRPEVAAGL GAPGARAGGR EGGAEDPEGA RHQPHDRHVQ RVPDRDRDHR
GEAEADDHSA TAAQPAPDHQ AGGGGPGPGA LLERLDLGLE PLTGRDVLGR GDERGARAAE
QVCHPTGHGI EAGPARCAVL VRDRPRAVGA VQVDGTLLGG VHEGGDVLLV VDLLGDQVGG
LGGADAVRRR VHHRELRVLL PGPPGQVEAV AAGRGGQDEH DGQRGQVGPE HGVQRRAPGS
LVAGRRRCQV QAHLDRHGGA RVERRRPDPQ VRGALPSEPA ARLVGADGAL DPLGRGGQVD
REVGRVDALG PAAGDQVPVE LGAGAVGARL QDAGDVVADL VHVLAGLDGR VRIADAGDLE
LATALVGRLD RDVGTVHRDL DAAGDPADGV RGRLVVGVPE PEDPVLAGER EVDREGLDGR
QVALVVHVEV RDVRRDRGLV RLGGSLVAAH PVERRGEHQH QHHDRQADHG PHHTLHGPEP
RGRRDSRVGK PRAGCRRPLG DCSAMSSPTR PDADRPTLAS FWHDLPREGR LLLSVVVFEF
IGTGLVLPFH VVYLHEVRGF ALSDVGLLLA LPPLIGFLVV GPGGTAIDRL GARRILIGAL
VLQTVANVTL AFSAAEWMAA GGLMLSGAAF GVSWPGFQAF IAAVVPVELR QRYFGVNFTL
LNLGIGIGGI VGGAFVDVDR LVTFQVIYLG DAISYLPALV LLLWPLRLVA GRPVHEGGAP
PATVSYREVM RRPAVASLML LSFVSSYVGY SQLNAGMPAF ARAVGEVSTQ GLGLAFAANT
VVIVVLQLVV LQRIEGRRRT RVIAVMSVVW ACSWVLLGAT GLVSGTWGAT LLVAGCASVF
AFGETLLQPT VPALVNELAP DHLRGRYNAL SSGSFQLAAI IAPPVAGYLV GHGLGSVYIG
SLVVGCLLCG ALAVLRVEPQ LSPEVNGVRA PAQVTTAADV TVPVPTAKTQ SSALD