Gene Noca_0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0806 
Symbol 
ID4598888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp849369 
End bp850565 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content66% 
IMG OID639775409 
ProductABC-type sugar transport system periplasmic component-like 
Protein accessionYP_922018 
Protein GI119715053 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGCA AGAAGTGGCT CGCGTCGACA GCCGTCACCC TCGTCGTCTC CCTCGCGCTC 
GTGGCGTGCG GCTCGGGCGA TTCGGAGGAC TCGAGCACCG GCTCGGGGAG CAACGGTGAC
TCGGCGGCGC TCGTCGCCTC GGCTGAGCAG TTGGTCGAAC GCGCCCAGCA GGGCCTGACC
TACTCGTCGA CAGACACCCC CGACACTCCT GACCAGATCG AGACCTACGG CGACTGGCGC
GGCCCGACGG ACGCACCCGA GATCAAGAAG GGTGCCAACG TCCAGGTGAT CGTGTGCACC
AAGCAGGCCG TCGCCTGCAT GGAGGGCGCT CAGGGCGTTG TCGACGCGGG GAAGGCCCTG
GGCTGGAAGG TCGAGGTCAT CGACGGCGGC GGAACTCCCG AAGGCAATGC CAAGGCCTTC
GACACCGCCT TCAGTCGCAA CCCCGACGCC ATCGTGGGTA TCGCCGTTCC TGCGCTGGCC
GTTGGAGACA AGCTCGAGGA GGCCAGGGCG CGCGGGATCA TCACCTCGGT GACCGGGGAC
ATCCCGCCGA CCGGGGGGGA GACCGCCTAC GACTCGATCG TGTCCTTCCG GATGACGCTG
ATGATGTCGA CGATCGGCTT CGCCGAGATC GCTCGCACCG ACGGCAAGGC CAACTCGATC
GTCGTGACGG ACAGCGGCAC GCCGTCTCTT GTCGAGGCAA TGGGCCAGTA CAAGAAGGTT
CTCGACACCT GCAGCGGCTG CGACTACACC GACGTCTCGT GGACCATCTC GGACGCGGTC
GACCCGACCA AGGTGACCTC GATCGTCACG GGTGCGATCT CGCAGAACCC CAACGCGACC
TCGATCGTCG TTCCGTACGC GATCGGCCTG CCGGCGGTCA TCCAGGCGGT CGCCTCGGCC
GGGAAGGCCG ACCAGATCAA GGTGATCACC AAGGACGCCG ACGCCGTGGG CCTGCAAGCC
GTCGCCGAGG ACCAGGTCAT CTATAACAGC GGGGCTTCGG CGACGTGGGC CGGCTGGGCC
GTGGCAGACC AGCTGGTCAG GGGGCTGGCC GGCGCGCCCT ACCTCGATGT CGACGAGATC
GGACTCGGGC TGATGTTCTT CATCAAGGAC AACGTGCCGA GCGACGGCAA GATCGACAGC
GCCGAGGGAA TGATCGACTA CGCCTCCTTC TACAAGAAGG CGTGGGGTCT GAGCTGA
 
Protein sequence
MRSKKWLAST AVTLVVSLAL VACGSGDSED SSTGSGSNGD SAALVASAEQ LVERAQQGLT 
YSSTDTPDTP DQIETYGDWR GPTDAPEIKK GANVQVIVCT KQAVACMEGA QGVVDAGKAL
GWKVEVIDGG GTPEGNAKAF DTAFSRNPDA IVGIAVPALA VGDKLEEARA RGIITSVTGD
IPPTGGETAY DSIVSFRMTL MMSTIGFAEI ARTDGKANSI VVTDSGTPSL VEAMGQYKKV
LDTCSGCDYT DVSWTISDAV DPTKVTSIVT GAISQNPNAT SIVVPYAIGL PAVIQAVASA
GKADQIKVIT KDADAVGLQA VAEDQVIYNS GASATWAGWA VADQLVRGLA GAPYLDVDEI
GLGLMFFIKD NVPSDGKIDS AEGMIDYASF YKKAWGLS