Gene Noca_4478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4478 
Symbol 
ID4596997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4734002 
End bp4735198 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content76% 
IMG OID639779089 
Productmajor facilitator transporter 
Protein accessionYP_925662 
Protein GI119718697 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGGGC GGCGCGCCCC GGGCGTGGAC CGGCCACCCG TCGCCCCGAT CCGGCTGCTC 
CAGCTCGGCG CCTTCGTCAG CACGCTGGAC CGGTTCGCGC TGCCGCCGAT GCTGGTCGCG
ATCGCCCACG ACCTGGGAGC CCCGCTCGGC GAGGTCGTCA CGGCGGCCGG CGCCTACTTC
CTGGTCTACG GCCTGAGCCA GCCGGTGTGG GGCACGGTCT CCGACCGGCT CGGCCGGGTC
CGGACGATGC GGATGACGCT GCTGCTGGCG GGGGTCTTCA CCCTCGTCTC GGCGCTGAGC
TGGTCGCCGC TCTCGCTCGG GGTGACCCGC GGCCTCGCGG GCGGGTTCTT CGGCGCGGCG
TACCCCTCGA GCCTGATCTA CCTCGGCGAC ACGGTGCCCG CCCCCTCCCG GCAGCGCGAC
ATCGCACGGC TGATGGTCGG GGTGGCGATG GGCACCGCGC TCGCCTCGGT CGGCGCCGGC
GTGCTCGCCG ACGCCGTGAG CTGGCGGGTG GCGTTCGTGG TCACCGGCAT CGCGTCGCTG
GTGATGACCT GGGCCCTGCG CGGCCTGCCG GAGCCCACCG CACACGGCAG GCCGGCCTCG
GCGATGGACG GGCTGCGCGC GATCAGCGGC GCGCCGATCG CGCTGCTGAT GCTGGTGTTC
GCGTTCACCG AGGGCGCGGT GCTGCTGGGC GCCCTCACGC TGCTGCCGCC CGCGGTCGAG
AACGCCGGCG CCACCGCGGC GCTCGCCGGT GCGGTGACCG CGATCTACGG GGTCTCGGTG
TTCGCCAGCT CGCAGCTGGT CGGGCGGCTC GCCGCGACCT GGCACCCGTC GCGGCTGATC
GCGATGGGCG CAACCGCCGC CGCCGCCGGC TGCGGGCTGC TCGCGGTCTC CCAGGAGCCG
GCCGTCGCAG TGGTGGTCGC CCTGCTCGTC GGCCTGGCCT GGACCTCGAT GCACTCCTCG
CTGCAGACGT GGGCGACCGA GGTGCTGCCG GGCGCGCGGG CGACCGTGGT CTCGTTCTTC
GCCGGGTCGC TGTTCGTCGG GAGCGCGCTG GCCGCGGTGC TGGTCGCCGG CCTCGCGGAC
GCCGGCCGCT ACACGGCGAT CTACGCCGTG TACGCCGCGC TCGCGGTGCC GCTCGGCCTC
GCGGCCGGTC TGGCGCGGCG GCGCTGGGTG CGCCCGGCCG CGGAGCGGGG AACCTAG
 
Protein sequence
MRGRRAPGVD RPPVAPIRLL QLGAFVSTLD RFALPPMLVA IAHDLGAPLG EVVTAAGAYF 
LVYGLSQPVW GTVSDRLGRV RTMRMTLLLA GVFTLVSALS WSPLSLGVTR GLAGGFFGAA
YPSSLIYLGD TVPAPSRQRD IARLMVGVAM GTALASVGAG VLADAVSWRV AFVVTGIASL
VMTWALRGLP EPTAHGRPAS AMDGLRAISG APIALLMLVF AFTEGAVLLG ALTLLPPAVE
NAGATAALAG AVTAIYGVSV FASSQLVGRL AATWHPSRLI AMGATAAAAG CGLLAVSQEP
AVAVVVALLV GLAWTSMHSS LQTWATEVLP GARATVVSFF AGSLFVGSAL AAVLVAGLAD
AGRYTAIYAV YAALAVPLGL AAGLARRRWV RPAAERGT