Gene Noca_2951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2951 
Symbol 
ID4595735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3136463 
End bp3138217 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content74% 
IMG OID639777556 
Productmajor facilitator transporter 
Protein accessionYP_924140 
Protein GI119717175 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.380575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGTGA GGTCCCCACG GGTCCTGCTC TCGCTGGCCG CGGTGGCCGT GGCCTTCGCC 
GCGGCCGACA CCTACGTCGT CGTGCTGGCG CTGCCGGACA TGATGGCGAG CACCGGGATC
CCCGTGGACC AGCTGCAGCG CGCGGCGCCG ATCATCTCCG GCTTCCTGCT CGGATACGTC
GCGATGCTGC CGCTGATCGG CCGGATCGCG GACCTGCGGG GCCGGGTGCC GGTGCTCGTC
GCCGCGCTCC TGGTCTTCGC AGTCGGCTCC CTGGTCACCA CGCTCGCCTA CGACCTGCCC
TCGATGGTCT CGGGACGGTT CCTGCAGGGC GTGGGCGGTG GCGGCCTGGT CCCGGCGACC
CTGGCCCTCG TCGCCGACCT GTACCCCACC GAGCGCCGCG GGGTGCCGCT CGGGGTCGTC
TCGGCGGTGC AGGAGCTCGG CAGCGTGCTC GGGCCGCTGT TCGGGGCACT GGTGCTCGCG
GTCGCCGACT GGCGAGCCAT CTTCTTGGTC AACGTCGCGG TCGGACTCGT CCTGGCCGCC
GCGATCCGCA AGCTCGCCGG GCCACGGGGT GGTCTCGACA CGCCGTCGCT CGCGACCCAC
TTCGACTGGC TCGGGCTGGC GCTGGCCGTG GGGACCCTGG TCTCGGGCGC GCTCGTGTTC
GTCGAGCCCA GCCAGCTCAT GCGCGACCTG ACCTGGGGCC GGCTGTTCGT CCCGGTGACG
GGGTCCGGCC GCTGGCTGAC CCCGCTCGGG CTGACCGCGA CGGGCGGCCT GCTGCTGCTC
GCCGTGCACT GCTCGACGGC GCGCCGGCCG CTCCTGGACG TGCGCGCCTG GGTGCGCACC
CTGCGCGCGG CCGACCTGCT CGGCTCGCTG TTCCTGGGGC TCGCGCTCGC CGGGGTGATC
CTCGCGTTCG CGACCGCCGA CCCCAAGGTG CAGGTCTTCT CCGACCAGGG CCTGTGGTAC
CTCCTCGCCT CCGCCCTGGC CGCCGTCGCC TTCGTGCTGC ACCTGCGCCG CACACCGGCG
CCCCTGGTCC CGGCCGGCAC GCTCCGGCGT ACCCCCGCGT GGGGCGCGAT GGCGGTGAGC
TTCTTCATCG GGGCCGCGCT GATCGCCGCC CTGATCGACA TCCCGCTGTT CGCGCGCACC
ACCGTCTACC CCGACTCCCA GCTGATGGCG GCGCTGGTCC TGGTCCGGTT CCTCGTCGCG
CTCCCGCTCG GCGCCGTCGT CGGGGGGTAC CTCACCCGCA CGCTCAGCGC CGGGCTGGTG
ACCGCCATCG GGATGGCCTG CGCGGCGGCC GGCTTCCTCT GGATGGCGAC CTGGGACCTC
GAGAGCCTCG ACCGGGCGAG TGCCACGGTG CCGCTGCTGC TCGGCGGCCT CGGGTTCGGG
CTCGCCCTCG CACCGGTGAA CGCCGCGGTG CTCGCCAGCA CCGACGACGA CGTGCACGGG
CTGGCCAGCG CGTTCGTCGT GGTCGCCCGG ATGGTCGGCA TGCTGGTCGG GATCTCCGCC
CTCACCACGA TCGGCCTGCG CCGCTACTAC GCCGAGCAGG CCGACCTGCC GCCCGCTCGC
GAGGTGTGCG ACGGGCGGAC CCGCTGCTCG GAGTTCACCG ACCTGCTCAA GGTCGCGGGC
ATCGCCCAGG AGCACACCGT CTTCACCGGC GCCGCGGTCT GCGCGGTCGG CGCGGCAGTC
CTCGCCCTGG TGCTGTTCCG GGGTGCGGCG ACCCGGGCGA TCTCGACCGG CGACCTGCTC
CGCGCCACCG GGTGA
 
Protein sequence
MTVRSPRVLL SLAAVAVAFA AADTYVVVLA LPDMMASTGI PVDQLQRAAP IISGFLLGYV 
AMLPLIGRIA DLRGRVPVLV AALLVFAVGS LVTTLAYDLP SMVSGRFLQG VGGGGLVPAT
LALVADLYPT ERRGVPLGVV SAVQELGSVL GPLFGALVLA VADWRAIFLV NVAVGLVLAA
AIRKLAGPRG GLDTPSLATH FDWLGLALAV GTLVSGALVF VEPSQLMRDL TWGRLFVPVT
GSGRWLTPLG LTATGGLLLL AVHCSTARRP LLDVRAWVRT LRAADLLGSL FLGLALAGVI
LAFATADPKV QVFSDQGLWY LLASALAAVA FVLHLRRTPA PLVPAGTLRR TPAWGAMAVS
FFIGAALIAA LIDIPLFART TVYPDSQLMA ALVLVRFLVA LPLGAVVGGY LTRTLSAGLV
TAIGMACAAA GFLWMATWDL ESLDRASATV PLLLGGLGFG LALAPVNAAV LASTDDDVHG
LASAFVVVAR MVGMLVGISA LTTIGLRRYY AEQADLPPAR EVCDGRTRCS EFTDLLKVAG
IAQEHTVFTG AAVCAVGAAV LALVLFRGAA TRAISTGDLL RATG