Gene Noca_3017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3017 
Symbol 
ID4596464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3213538 
End bp3214818 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID639777622 
Productextracellular ligand-binding receptor 
Protein accessionYP_924206 
Protein GI119717241 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGCT CCAGCCGCGC TCGCCGCGCA GCCATCACGC TCGCGGCATC CGCACTCGTC 
CTCACTGCTT GTGGCAGTGA CGGCGGCTCC GACAGCAAGT CCGACGCTCC CGACGAGGGC
ACCTCCTCGT CGGCGCCGGC AACGACGGGT GACGGCGTGC TCAAGATCGG CCAGCTGCTG
CCCCAGACCG GTGACCTCGC CTACCTGGGC CCCCCCGAGT TCGCGGGCGT CGACCTGGCC
ATCAAGGAAA TCAACGACGC GGGTGGCGTG CTCGGGAAGC CCGTCGAGAG CTTCAAGGCG
GACTCCGGCG ACGGTACGCC GGATATCGCG GGCGCCTCCG TCGACTCGCT CCTCCAGGAC
AGCGTCGACG CCATCGTCGG TGCCGCCGCC TCCGGCGTGT CGCTCTCGGT GATCGACAAG
ATCACCGGTG CCGGCGTCGT GCAGATCTCC CCGGCGAACA CCGCCGCGGC GTTCGACACC
TACGACGACG GCGGCCTGTA CTTCCGCACC GCGCCGTCCG ACCGCCTGCA GGGCCAGGTG
CTCGGCAACA TGGCGGTCGA GGACGGGTTC TCCAACGTCG CCGTGATGGC GCGCCAGGAT
GCGTACGGCG AGGGTCTGGC CGAGCAGGTC GACCAGACGC TGAAGGAGCA GGGCGCCAAC
GTCGCGGCCC ACATCCTCTA CGCCGCCGAC GCTCAGAACT TCACCGCAGA GGTCAACGAG
ATCGCGGCGG CCAAGCCCGA CGCTCTCGTG CTGATCGCGT TCAACGAGAC GACGAAGATC
ATCCCGCAGC TGATCGCCAA GGGGATCGGC CCGCAGGACA TCCAGCTCTA CTTCGTCGAC
GGCAACATGG CCGACTACTC CGCCGAGTCC TTCGACCTGG AGGGCGTCAA GGGCACCTTC
CCGGCTCCGG CCGAGGTCGA CGAGAGCTTC AACCAGCGGC TGCTCGAGGT GGACCCGAAG
CTGAAGGACT TCACCTACGG CCCGCAGTCC TACGACGCCA CGATCCTCAC CGCGCTCGCT
GCGATCGCGG CCGGGGACGA CTCCGGCGAG GCGATCGCCG GCGAGCTGGT CAACGTCTCC
AAGGACGGCG AGGCCTGCAC CACGTTCGCC GACTGCGCGA AGCTGCTCGA GGACGGCCAG
GACATCAACT ACGAGGGTGT CTCCGGCCCG ACCGACATGA ACGACACCGG CAGCCCGAAC
GCTGCGACGA TCGGCATCCA GGAGTACGCC AAGAACAACA AGTACTCGCA GATCGACTCG
GTCTCCGGCG TCCTGGAGTG A
 
Protein sequence
MIRSSRARRA AITLAASALV LTACGSDGGS DSKSDAPDEG TSSSAPATTG DGVLKIGQLL 
PQTGDLAYLG PPEFAGVDLA IKEINDAGGV LGKPVESFKA DSGDGTPDIA GASVDSLLQD
SVDAIVGAAA SGVSLSVIDK ITGAGVVQIS PANTAAAFDT YDDGGLYFRT APSDRLQGQV
LGNMAVEDGF SNVAVMARQD AYGEGLAEQV DQTLKEQGAN VAAHILYAAD AQNFTAEVNE
IAAAKPDALV LIAFNETTKI IPQLIAKGIG PQDIQLYFVD GNMADYSAES FDLEGVKGTF
PAPAEVDESF NQRLLEVDPK LKDFTYGPQS YDATILTALA AIAAGDDSGE AIAGELVNVS
KDGEACTTFA DCAKLLEDGQ DINYEGVSGP TDMNDTGSPN AATIGIQEYA KNNKYSQIDS
VSGVLE