Gene Noca_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0030 
Symbol 
ID4598384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp33397 
End bp34605 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content75% 
IMG OID639774645 
Productmajor facilitator transporter 
Protein accessionYP_921267 
Protein GI119714302 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.63894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGG GCTCCCGCGC GGTGCTGCCG CGACCCACCG GCGGCTTCGA CCGCGCATCG 
ATGAGCGCCT CCCTGGCCGG GGTGGCGCTC AGCGTGCTTC CCGTCTACCT CCTCGGCGGA
CTCACGGTCT TCATCGCCGA CGACCTCGAC TACGGGGTGG TCGGACTCGG GGCGGCGGCC
TCGGCGTTCT ACGCGGCCTC CGCGGTCGCC TCGATCCCGG CCGGCCGAGT CGCCCACCGG
CTGGGAGCCG AGCGGTCCTT GATCCTCGGC GTGTGCGTGT CGTCCGCGGC GCTGCTCGCG
GTCGCCCTGG TCGTGCAGGC CTGGTGGAGT CTGTGTGTGG CCCTGGTCGT CGGCGGCGTG
GCGACCGCGC TCGTCCAGCC GGCCGCCAGT GCGCTCGTGG CACGGGACCA GCCCGACAGC
CGCCACGGCC TGAGCTTCGG CCTGCTCCAG ACGGCCGTCC CGGTGGCCAC CCTCGCTGCC
GGACTGGCGG TGCCGGTGAT CGGGGCCACG TGGGGCTGGC GATGGGCCTA CGGCCTCCTG
GCGCTGGGCG CCATCCCGAT CGTCGCGTTC GGGCTGCGCC GGGCACCGCG CCGTCTGGAG
CGGCGAGCCA ACCCTCCGAG CGCGTCCCGG CCGATCCGCG GCGCCGCGAT CGTCCTGCTC
ACGCTCGCCG GCGCCTGCGC GGCCGCGCCC GCGAACGCCC TCGGCGCCTA CTACGTCGAG
TCCGCGGTCG CCGCCGGCAT CTCCCGGTCC GGCGCGGGTC TGTGGCTGGT GGCCGGCAGC
GCCGTGGGTA TCGGGGCGCG CCTGCTGTGG GGTTGGATGA TCGACCGCAG GGGGGCCGAC
CCGCGCCTGT GGATCGCGGG CCTGATGCTC AGCGGAGGCG TCGGCTTCGC GCTGCTCAGC
GTGACCCACA CCGCTCCCGT CCTGTTCGCC GCCACCGCGC TGACCTTCGG GGCCGGGTGG
GCGTGGAAGG GTCTGTACAA CCTCGCGGCG ATCCGGCACG ACCCCGAGCA GCCGAGCGCG
GCCGTCGGCA TCGCGCAGTT CGGGGTCTAC GTCGGGAGCG TCCTGGGCCC GATCGCTTTC
GGCGTCCTCC TGGCGCACAG CTCCTACGCC GTGGCCTGGG CGGCCGCCGG GGTCACCTGC
CTGCTGACCC CGGTGCTCAT CTGGGTCCAG CGCCGGCTCG GTCGGCCGGC CGCCGGGCAC
GGTATCTGA
 
Protein sequence
MAEGSRAVLP RPTGGFDRAS MSASLAGVAL SVLPVYLLGG LTVFIADDLD YGVVGLGAAA 
SAFYAASAVA SIPAGRVAHR LGAERSLILG VCVSSAALLA VALVVQAWWS LCVALVVGGV
ATALVQPAAS ALVARDQPDS RHGLSFGLLQ TAVPVATLAA GLAVPVIGAT WGWRWAYGLL
ALGAIPIVAF GLRRAPRRLE RRANPPSASR PIRGAAIVLL TLAGACAAAP ANALGAYYVE
SAVAAGISRS GAGLWLVAGS AVGIGARLLW GWMIDRRGAD PRLWIAGLML SGGVGFALLS
VTHTAPVLFA ATALTFGAGW AWKGLYNLAA IRHDPEQPSA AVGIAQFGVY VGSVLGPIAF
GVLLAHSSYA VAWAAAGVTC LLTPVLIWVQ RRLGRPAAGH GI