Gene Noca_3680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3680 
Symbol 
ID4595792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3902770 
End bp3903864 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content65% 
IMG OID639778288 
ProductABC transporter related 
Protein accessionYP_924867 
Protein GI119717902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCTG TCGAGTTGCG TCACGTCTCC AAGATCTTCG GTTCCAAGAC GACTGTCGAT 
GACATCAGTC TCACCCTGCC CGACGGACAG CTCACCGTGT TGGTCGGGCC ATCCGGGTGC
GGAAAGACGA CCACGCTCCG CATGATCGCT GGCTTGGAAG CCGTTTCCCA CGGCTCGATC
CACTTCGACG GAGAGGACGT CACCGGTGGA GAACCGCGGA CTCGCGATGT GTCGATGGTG
TTCCAGAACT ACGCCCTCTA CCCCCATCTG ACGGTTCAGG ACAATCTTGC CTTTCCGGTT
CTTGCTCGCG GCGGCAAGCG CGCCGATGCC ATTCGACGAG CACGTGAGGC GGCTGAGATG
CTCGGGCTCA CCGAACTGCT GCAGCGCAAG CCTGGGCAAC TCTCGGGGGG ACAGCAGCAG
CGGGTGGCAA TCGGACGTGC CGTCGTGCGA GAACCGCGGG TGTTCCTGTT CGACGAGCCG
CTGTCCAACC TGGATGCGCG GTTGCGGGTG GAGATGCGCT CGGAGATCCT CCGGCTGCAG
CGTCAGCTTG GTGTCACGGC CGTCTATGTC ACCCACGACC AGGAGGAGGC GATGACCATG
TCCGACAGCA TGGTCGTCAT GGACGGCGGC ACCATCGCTC AGCAGGGCAG CCCGCGGGAG
GTCTACGCCG CTCCAGCCAC CACTTTCGTC GCCGGATTCG TCGGATCGCC CCGCATGAAC
CTGATCGCCG GTCGGGTCGT CGGTGGGGTC TTCGAGTCTC GGTGGGGTCG AGTGCCGATG
GGTGCCGCCG ACCAGGAAGG CAGCTTGGGT GTACGCCCCG AGCTCGTTCG TCTGGTCGGG
GCTGACCACA ACGAGTCGAG CCGGGCAAGG AATGATCCTG GCGCCGGTGC GGGCGCTGCG
GCCCGAGTCG AGCTGGTCGA GCTTCTAGGT CCGCGAGCCA TCGTCTCGCT CAACGCCGAT
GGCGAGCGGC TCATTGCAGT CGTGGAGGCT CGCGACCTGT CGGGCATCCA TGAGGGCAGC
CTGGTCGACG TGGACTTCGC GTCTGCGGGC CTGCACTTCT TCGAAGCCGG CGGACAGCGG
CTGTTGACGA CGTGA
 
Protein sequence
MAAVELRHVS KIFGSKTTVD DISLTLPDGQ LTVLVGPSGC GKTTTLRMIA GLEAVSHGSI 
HFDGEDVTGG EPRTRDVSMV FQNYALYPHL TVQDNLAFPV LARGGKRADA IRRAREAAEM
LGLTELLQRK PGQLSGGQQQ RVAIGRAVVR EPRVFLFDEP LSNLDARLRV EMRSEILRLQ
RQLGVTAVYV THDQEEAMTM SDSMVVMDGG TIAQQGSPRE VYAAPATTFV AGFVGSPRMN
LIAGRVVGGV FESRWGRVPM GAADQEGSLG VRPELVRLVG ADHNESSRAR NDPGAGAGAA
ARVELVELLG PRAIVSLNAD GERLIAVVEA RDLSGIHEGS LVDVDFASAG LHFFEAGGQR
LLTT