Gene Noca_2936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2936 
Symbol 
ID4597437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3117134 
End bp3118660 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content74% 
IMG OID639777541 
Productsulphate transporter 
Protein accessionYP_924125 
Protein GI119717160 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.304308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCAC GGATCGTGCC ATCGGCGCGG GGTGACGTCG TCGCCGGTGT GACCGTCGCG 
CTGGTGCTGG TCCCGCAGGC GCTCGCCTAT GCGACGATCG CCGGACTCGA TCCCGTCTAC
GGCCTGTACG CCGCCGTGGC CGCGCCCATC GCCGGGGCGC TGGTCGGGTC CTCCCCGTAC
CTGCAGACCG GCCCGGTCGC GGTGACCAGC CTGCTGACGT TCGGCGCACT GGAGCCCCTC
GCGCGACCGG AGACGCTGCG CTTCGCCGCG CTCGCGGCCG TGCTCGCCGT CCTGGTCGGC
ATGGTGCGGG TCCTGCTCGG CCTCCTCGGC GGCGGCCCGA TCGCCTACCT GATGTCCCAG
CCCGTCGTGG TCAGCTTCAC GACCGCGGCG GCGCTCCTGA TCATCGGGAC CCAGGTGCCG
GCCCTCCTCG GGATGCAGGG CGACTCGGCC AACCCCTTGG TGGGCGCCGT CCGGGCGCTG
GCCGACCCCG CAGCGTGGAG CTGGACCGAC CTGGTCGTCG GACTGGTCGC GATGACGCTG
ATGCTCGGCA GCCGGCGCGT CTGGTCGCTG TTCCCCGGCG CCCTGCTGGC CGTCGTCCTC
GCCGTGGTCT GGAGCCGGGC GACGGGCTAC GACGGGCGCA CGGTCGGCGC GGTGGACCTC
TCGTACAGCA CTCCCCAGGG CGTGTCGGCC ACCGATCTCG CGACCCTCCT GGTGCCCGCC
CTCGTCATCG CGATCGTCGG CTTCGCCGAG CCGGCCTCCA TCGCTCGTCG CTACGCCGCC
GCGGACCGGC AGCCGTGGAA CCCGAACCTT GAGTTCGTCG GGCAGGGGCT GGCCAACCTC
GCGTCCGGCG CGGCCGGCGG GTTCCCCGTC GGCGGCTCGT TCTCGCGCAC CAGCCTGAAC
CGGCTCAGCG GGGCTCGGAC CCGGTGGAGC GGCGGCATCA CCGGGCTGGT GGTCCTGGCC
ATCCTCCCGT TCGTGTCCGT GCTGTCCGCG CTGCCGCTCG CCGTCCTGGC CGGCCTGGTG
ATCGGGGCGG TCGCCTCCCT GGTCGACGTG CGGACGCCGC TGCTCTACTG GCGCTGGTCG
AAGCCGCAGT TCTCCGTGGG GGTGCTCACC GCGGTCGCCA CGATGGCCCT GGCGCCCCGG
GTCGAGCGGG GTGTCCTGGT CGGTGTCGCG GCCGCGCTGG CGGTGCACCT GTGGCGCGAG
ATGGGGGTGC ACCTGCCCGC CTTCGTGGAG GACGCGACCC TGCACCTGCG GCCGACCGGC
GTGCTCTACT TCGGCTCGGC TCCGGCTCTC GAGAGGAGCA TCTCGAGGCT GATCGCGGAG
CACCCCTCCG TCGACCGGGT GGTGCTGCAC CTGGACCGGA TCGGCCGGCT CGACCTCACC
GGCGCGCTGA TGCTGCGCGA CATCCTCGCC GACGCGGAGA GCGCCGGGCG CACGTTCGAG
ATCCGGGGTG CTCGCGCACA CGCCGCGGGA CTGCTGGTGC GGTTGCTGGG GCCGGAAGCA
CGCATCTGCG GTGACGACGT GGCCTGA
 
Protein sequence
MRPRIVPSAR GDVVAGVTVA LVLVPQALAY ATIAGLDPVY GLYAAVAAPI AGALVGSSPY 
LQTGPVAVTS LLTFGALEPL ARPETLRFAA LAAVLAVLVG MVRVLLGLLG GGPIAYLMSQ
PVVVSFTTAA ALLIIGTQVP ALLGMQGDSA NPLVGAVRAL ADPAAWSWTD LVVGLVAMTL
MLGSRRVWSL FPGALLAVVL AVVWSRATGY DGRTVGAVDL SYSTPQGVSA TDLATLLVPA
LVIAIVGFAE PASIARRYAA ADRQPWNPNL EFVGQGLANL ASGAAGGFPV GGSFSRTSLN
RLSGARTRWS GGITGLVVLA ILPFVSVLSA LPLAVLAGLV IGAVASLVDV RTPLLYWRWS
KPQFSVGVLT AVATMALAPR VERGVLVGVA AALAVHLWRE MGVHLPAFVE DATLHLRPTG
VLYFGSAPAL ERSISRLIAE HPSVDRVVLH LDRIGRLDLT GALMLRDILA DAESAGRTFE
IRGARAHAAG LLVRLLGPEA RICGDDVA