Gene Noca_4322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4322 
Symbol 
ID4596840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4571149 
End bp4572405 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content74% 
IMG OID639778932 
Productmajor facilitator transporter 
Protein accessionYP_925506 
Protein GI119718541 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.369395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCGT ACCGGACCCT CGCCCACAAC CGGGACTTCA CCGTGCTGTG GTGCGCCCAG 
ACCATCTCCG AGCTGGGCTC GCGGGTCAGC AGCTTCGCGA TGCCGCTGGT CGGGTACGCG
ATGACCGGCT CGGCGTTCTG GGCCGCGGCC GCGGAGGCCG CGTACCTGCT CGGGATGGTC
GTGATGCTCC TGCCCGGCGG GGTGCTCGCC GACCGCTGCG ACCGGCGCCG GCTGATGCGG
CTCTCGCACG GCGGCGGCGC GCTGCTGTAC GCCTCGCTGG TCACGGCGGG CATGCTCGAC
GTGCTCACCC TCCCCCACCT GCTCCTCGTG GCGCTGCTGA CCGGCCTCGC GGCCGGCCTC
TTCGTCCCGG CGGAGGGCTC CGCGATCCGC ACGGTGGTGG CGGCCGACGA CCTGCCCACC
GCCCTGAGCC AGCAGCAGGC CCGCCAGCAC GTCGCCTCGC TGGTCGGTGG CCCGCTCGGC
GGCGCTCTCG TCGCCGCCAC CCGATGGGCG CCCTTCCTGT TCGACGCGAT CACCTACGCC
GCGGGCTGGG TACTGCTCGG CCGGATCCGG GCCGACCTCT CCGGCCGACC GCAGGCCGGC
ACGGGGCGCG CCCTGCACGA CCTGGGCGTG GGCCTCCGGT TCACCTGGTC GAGACCGTTC
TTCCGCGTGC TGCTGCTGTG GTCGCCCCTG ATCAACCTGA CCGTTAACGC CCTGTTCTTC
GTCGCGCTGC TGCGGCTGGT CGAGGCCGGC TTCCCCGCCT TCCAGATCGG GCTCGTGGAG
GCGACGATCG GCAGCTGCGG CATCCTCGGC GCGCTGGCCG CGCCGTGGCT GATCGACCGG
CTGGCGACCG GGACGCTGAC CGTGGCCGTC GCCTGGAGCT TCGTCCCGCT CTCGGTGCCG
CTGGCCCTCT GGAACCACCC CGTGGTGATG GCGGCGGCCG CCTCGGTGGG GCTGTTCCTC
AACCCCGCGG GCAACGCCGG TGTCGGGTCC TACCGGATGG CGGTCACCCC GTCGGAGCTG
GTCGGCCGGG TGCAGTCGGC GATGCAGTTC ACCTCGATGC TCTCCATGCC GCTGGCGCCC
GCGCTCGCGG GCGCGCTGCT CACCGGGCTC GGCGGGCCGG CGGCGGTCCT CGCGCTGACC
GGACTCACCG CTGCGGTCGC CCTGATCCCC ACCCTGTCCA CCTCCGTCCG CTCGGTCCCC
CGCCCGGCCG ACTGGCCGCG CTACGAGACG CCCATCGTGG CCTCCGCCGC CGCCTGA
 
Protein sequence
MTSYRTLAHN RDFTVLWCAQ TISELGSRVS SFAMPLVGYA MTGSAFWAAA AEAAYLLGMV 
VMLLPGGVLA DRCDRRRLMR LSHGGGALLY ASLVTAGMLD VLTLPHLLLV ALLTGLAAGL
FVPAEGSAIR TVVAADDLPT ALSQQQARQH VASLVGGPLG GALVAATRWA PFLFDAITYA
AGWVLLGRIR ADLSGRPQAG TGRALHDLGV GLRFTWSRPF FRVLLLWSPL INLTVNALFF
VALLRLVEAG FPAFQIGLVE ATIGSCGILG ALAAPWLIDR LATGTLTVAV AWSFVPLSVP
LALWNHPVVM AAAASVGLFL NPAGNAGVGS YRMAVTPSEL VGRVQSAMQF TSMLSMPLAP
ALAGALLTGL GGPAAVLALT GLTAAVALIP TLSTSVRSVP RPADWPRYET PIVASAAA