Gene Noca_4082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4082 
Symbol 
ID4596596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4310245 
End bp4311825 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content66% 
IMG OID639778688 
Productextracellular solute-binding protein 
Protein accessionYP_925266 
Protein GI119718301 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.484283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGCA CAGCACGTAG TCGCTCGGCA CGAGCCCGGG CCGGGCTCGT AGCGGTCGCC 
GCAGCGGCCG CCCTCCTCCT CGCCGCGTGC GGGGGAGGCT CGGGTGGCAA CCAGAGCGAC
GGGCCGGAGA CCACGAAGAT CACCGACCTG GTGGTCGACA GCGGTCAGGA GCCCGACTCC
CTGGACCCCT TCTACCGCAA CACCGCGGAG GCGCAACGCT TCTACCGCCT GGCCTACAGC
AGCATCCTGA AGTGGAACGA GGACGGTTCG CTCGCCCCTG ACCTGGCGGC GGAGCTGCCC
GAGGTGACCG ACGGCGGCAA GACGTGGACG ATCACGCTCC GCGACGGAGT CACCTTCCAT
GACGGCACAC CGCTGACGGC CGACGACGTG GTCTTCACGT TCGAGGCCGC GGCGAACCCC
GAGAACGGTG CCGTCTGGCT CTCCTCGCTG AGCTACATGG AATCCGTCAA GGCCGTCGAC
GACACCACGG TCGAGCTGAA GCTGACCGAG CCGTACGCCT ACATGGGTAG CCGGCTCGCG
ATGATACCGA TCCTGTCGGA CGAGACGCCG TACAAGACCA ACGACACCTA TGCCACAACC
GAGAACGGCA GTGGTCCGTA CGTGCTGGAG AAGCTCAACC GCGGCGACTC CATCGAGATG
GCGCGGTTCG GAGACTACTT CGGCGACCAG CCGCCCTTCG AGACCATCAC CTTCAAGGTC
GTTCCGGAGG ACGCCTCCCG GATCGCTCGC CTGCTCAACG GTGAGTCCCA CATCCTGCCG
AACGTACCCA CCGACCAGGT CGAGCTGATC AAGGACCGCG GCGCGAACGC CGCGATCGTC
GAGAAGAACG TCGTCCGCCT GTTCCTCTAC CCGTCGATGA ACCCCGACCG GCCGACCTCG
AACGTCGACT TCCGGCTTGC GATCGCCTAT GCGGCCGACC GGCAGCGGAT CGTGGACCAG
GTGTACGGCG GCGCGGGCCG TCCGAACAGC ACCTACCTGA CCTACGGATC GCTCTACCAC
GACGAGGAGG TCGGGATGAC CTTCGGTTCG ACGCCCGACA TCGAGGCCGC GAAGGAGCAC
CTCGAGGCGT CGGGCTACGA CACGAGCCGC ACCCTGAAGA TCATCGCGGT GAACAAGCCG
AGCGTGGTGA GAGCGATGAC GATCCTGCAG GCCAACCTCA AGGCGATCGG CGTGACCGCC
ACCGTCGAGT CGCAGGAGGT GGCCGGCTTC TACTCGGCGC TCATCTCCGG GGAGTACGAC
CTGATCGCCT TCGACAGCCC GGCGTCGACG TCGGCGGGCT TCGCTCCCGA CTACGTCAAC
GGTGGCCTGA ACAGCAAGGC GGCGAACAAC TTCGCGAAGT TCAACGACCC GGAGATGGAT
CGGCTCCTCG ACACGGCCAT GACGGCCCAG ACCGAGGAGG AGCAGGCCGC GGCCTGGAAG
GCGGTCCAGG AGCGTGACGT CGCGACGCAG GGCAACATCC AGCTCGTCGC GGCTCAGGTC
AGCGAGGCAT GGTCCAAGGA CCTGGTGGGC TACGAGCCCT CCGGGCTCCT GTGGCTGAAC
ACCGTGCTCG ACGTCAAGTA G
 
Protein sequence
MTSTARSRSA RARAGLVAVA AAAALLLAAC GGGSGGNQSD GPETTKITDL VVDSGQEPDS 
LDPFYRNTAE AQRFYRLAYS SILKWNEDGS LAPDLAAELP EVTDGGKTWT ITLRDGVTFH
DGTPLTADDV VFTFEAAANP ENGAVWLSSL SYMESVKAVD DTTVELKLTE PYAYMGSRLA
MIPILSDETP YKTNDTYATT ENGSGPYVLE KLNRGDSIEM ARFGDYFGDQ PPFETITFKV
VPEDASRIAR LLNGESHILP NVPTDQVELI KDRGANAAIV EKNVVRLFLY PSMNPDRPTS
NVDFRLAIAY AADRQRIVDQ VYGGAGRPNS TYLTYGSLYH DEEVGMTFGS TPDIEAAKEH
LEASGYDTSR TLKIIAVNKP SVVRAMTILQ ANLKAIGVTA TVESQEVAGF YSALISGEYD
LIAFDSPAST SAGFAPDYVN GGLNSKAANN FAKFNDPEMD RLLDTAMTAQ TEEEQAAAWK
AVQERDVATQ GNIQLVAAQV SEAWSKDLVG YEPSGLLWLN TVLDVK