Gene Noca_4632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4632 
Symbol 
ID4596088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4912744 
End bp4914003 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content69% 
IMG OID639779241 
Productextracellular solute-binding protein 
Protein accessionYP_925814 
Protein GI119718849 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.448727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACA ACCGCAGGCG TCTGACGGCG ATCGCCGTCG CCGGCGTCGC CTCCCTGGCC 
CTGGGCGCCT GCTCCCAGGG CTCGGCGACT TCGAAGGACG ACGGGGCCGA CGGCCAGACC
ACGATCACCT ACATGGAGTT CTCCTCCAAC GGGGGGCACG AGAAGGACCT GGCCGCGATC
GTGGACGCGT TCGAGGCCGA CCACCCCGAC ATCAAGGTCG AGGTGGAGAC CACGCCGTAC
GACGCGTACT TCACCAAGCT CCAGACAGCA CTCGCCGGCG GCACCGCCGG GGACGCCTTC
GAGCTCAACT ACGAGAACTT CGTGACGTAC GCCGAGAACG GCTCGCTCGC CCAGCTCGGG
TCCTTCGACG AGGCGGCCTA CAAGCCGTCG CTGCTCGACG CGTTCGCGCA GGACGGCGCC
CAGTACGCGT TGCCCGAGTC CTTCTCCGAC GTGGTGCTCT TCTACAACAA GGAGCTCTTC
GACAAGGCCG GCCTGGAGAC GCCCACCTCG GACTGGACCT GGGCGGACGA GCGCGCCGCC
GCCGAGAAGC TGACCGACAA GGACGCCGGG ATCTGGGGCG ACTACCAGCC GGTGCAGTTC
TTCGAGTTCT ACAAGGCGCT CGCCCAGTCC GGTGGGTCGT TCTTCAGCGA GGACGGCTCG
GAGGCGACGT TCGACTCCCC CGAGGGCATC GAGGCGGCCG AGTGGCTGGT GAGCAAGCCG
GGCAGGACCA TGCCGACCGA GGCCGAGGGC GCCGGCACAC CGGACTTCGA CACGAACCTG
TTCAAGGACG GCAAGCTCGC GATGTGGCAC AGCGGCATCT GGATGTTCGC CGGCCTGGCC
GACGTGCCGT TCGAGTGGGA CATCGCCGTC GAGCCGGGCA ACACCCAGCA GGCGTCGGCC
ATGTTCGCCA ACGGGGTCGC CGTCAACGCG GCGAGCGAGA ACAAGGCGGC TGCCGAGGAA
TGGCTGTCCT ACCTGACCTC GTCCGAGGTC ACGGCGGACA CCCGCCTGAG CACCTCGTGG
GAGCTGCCGC CGGTGGCGGA CGAGTCCCTG CTGGCGCCGT ACCTCGACCA GGACAAGCCC
GCCAACCGGG CCGCTGTGAT GGAGTCCCTG GAGTCCGTGG CGCTGCCGCC GGTCATCGCT
CGGCAGGCCG AGATGCAGGA CGCGATCACC CAGGAGCTCG GCGAGGCGGC CGCAGGCCGC
AAGAGCGTGA AGGACGCGCT TGCGGACGCC AAGAAGGCCG TGGACGCCCT GCTCGGCTGA
 
Protein sequence
MKNNRRRLTA IAVAGVASLA LGACSQGSAT SKDDGADGQT TITYMEFSSN GGHEKDLAAI 
VDAFEADHPD IKVEVETTPY DAYFTKLQTA LAGGTAGDAF ELNYENFVTY AENGSLAQLG
SFDEAAYKPS LLDAFAQDGA QYALPESFSD VVLFYNKELF DKAGLETPTS DWTWADERAA
AEKLTDKDAG IWGDYQPVQF FEFYKALAQS GGSFFSEDGS EATFDSPEGI EAAEWLVSKP
GRTMPTEAEG AGTPDFDTNL FKDGKLAMWH SGIWMFAGLA DVPFEWDIAV EPGNTQQASA
MFANGVAVNA ASENKAAAEE WLSYLTSSEV TADTRLSTSW ELPPVADESL LAPYLDQDKP
ANRAAVMESL ESVALPPVIA RQAEMQDAIT QELGEAAAGR KSVKDALADA KKAVDALLG