Gene Noca_3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3320 
Symbol 
ID4600212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3528281 
End bp3529327 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content66% 
IMG OID639777926 
ProductABC sugar transporter, periplasmic ligand binding protein 
Protein accessionYP_924509 
Protein GI119717544 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCACA GCAGACTTGC CCGTGGAGCC ATCGTCGGCG CGCTCGCGTT GGCGCTCGCG 
GCGTGCAGTT CGGTCGATGA CAGTGGATCC GACGACAACC CCGGGGCGGC CGGTTCCAGC
GCGTCCGGAG GCGACGGTGC CTACGTCGTC GTCGTGAAGG CGACCGGCAT CGGCTGGTTC
GACCGCATGG AGGTGGGCGT CAAGGAGTGG GCCTCCGAGA CCGGGCTCGA CGCACGTGAA
GAGGGCCCGT CCGAGGCCAC GAACGAGGGC CAGGTCGCGA TCATCCAGGA CCTCATCGCA
CAGAAGCCCA CCGCCATCTC GGTCGTGCCG AACGACCTTG CCGGCCTGGA GAGCGTTCTG
GGCCAGGCCC GCGATGCGGG AATCATCGTC GTCAGCCACG AGGCCGTCGG CATCAAGAAC
GTCGACATCG ACATCGAGGG CTTCGAGAAC ACGGCCTACG GCGCGCAGAT CATGGAGAAC
CTCGGCGAGT GCACGGGCGG CGAAGGCGAA TACGTCCAGT TCGTCGGGCG GCTGACCAAC
GGATCTCACT CGGAGTGGGT CAAGGGCGCC TATGAGAAGC AGCAGGCGGA CTTCCCCGGC
ATGACGCGGG TAGAGGACCC GATCGAGTCG GAGGAGAACG AGGAGGTTGC CTACAACAAG
GCGAAGGAGC TCCTCTCGAA GTACCCGGAC ATCAAGGGCT TCCAGGGCTC GGCCGGCACC
GACGTCGTGG GCATCGCTCG TGCCGTCGAG GAGGCAGGGA AGACCGACAG CGTCTGCGTG
ATGGGCACCA GCATCCCGTC CGTCGCGGGT GGCTACCTCG AGAGTGGAGC CATCGACAAG
ATCTTCTTCT GGGACCCGGC TATGGCGGGC AAGGCACAGC TGGCCGTCGC GAAGATCCTC
GCTGAGGGCG GAACGATCGA GGAAGGTACC GACCTGGGGG TCCCCGGGTA CGAGTCGCTC
GTGAAGCTCG ACGGTGCTGA CAACGCGTTC GCCGGCAACG CAGGGGTCGC GGTCGACGTC
GAGAACATGG ACGAGTACGA CTTCTAG
 
Protein sequence
MRHSRLARGA IVGALALALA ACSSVDDSGS DDNPGAAGSS ASGGDGAYVV VVKATGIGWF 
DRMEVGVKEW ASETGLDARE EGPSEATNEG QVAIIQDLIA QKPTAISVVP NDLAGLESVL
GQARDAGIIV VSHEAVGIKN VDIDIEGFEN TAYGAQIMEN LGECTGGEGE YVQFVGRLTN
GSHSEWVKGA YEKQQADFPG MTRVEDPIES EENEEVAYNK AKELLSKYPD IKGFQGSAGT
DVVGIARAVE EAGKTDSVCV MGTSIPSVAG GYLESGAIDK IFFWDPAMAG KAQLAVAKIL
AEGGTIEEGT DLGVPGYESL VKLDGADNAF AGNAGVAVDV ENMDEYDF