Gene Smal_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_1767 
Symbol 
ID6475638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp1977762 
End bp1978760 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content67% 
IMG OID642730949 
ProductN(4)-(beta-N-acetylglucosaminyl)-L-asparaginase 
Protein accessionYP_002028154 
Protein GI194365544 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1446] Asparaginase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.250239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGATC GCAGGCAGTT CCTGCAGGCC GGTGCACTGG CCGCAGGCAT GGCGGCATTG 
CCGGGCGTGC AGGCACGCAC GCAGGGTGGG GCCAAGGTGG TTTCGACCTG GGACTTCGGC
GTACCGGCCA ACCAGGCCGC ATGGAAGGTA CTGGCGCAGG GCGGCAGCGC GCTGGATGCG
GTCGAAGCGG GCGCACGCTG GGCCGAGAGC GAGTTGTGCA ACCCCACCGT CGGCCATTGC
GGCAATCCGG ATCGCGACGG CGTGCTGAGC TTGGACGCGA GCATCATGGA CGGCGATGGC
CGTTGTGGTG CAGTGGCCGC GCTGGTCGAC ATCCTGCATC CGGTGTCGGT GGCCCGCAAA
GTGATGGAGA ACAGCCCGCA CGTGCTGCTG GTGGGCGAGG GCGCGCAGCA GTTCGCGGTG
CAGCAGGGTT TCGAGCGCAA GCACCTGCTG ACGCCGCAGG CTGAAGCCGC CTGGCACGAG
TGGCTGAAGA CCGAGAAGTA CCAGCCGCAG ATCAATGCCG AGCGCCGCGG TATTCCCGGC
AACAGCGACA ACCACGACAC CATCGGCATG CTGGCACTGG ATGCCAAGGG CCACCTGGCC
GGTGCCTGCA CCACCAGCGG CATGGCCTGG AAACTGCATG GCCGCGTCGG CGACAGCCCG
ATCATCGGTG CCGGCCTGTA CGTCGACAAC GACGTGGGTG CAGCCACTGC CTCGGGCGTG
GGCGAGGAGA TGATCCGCAA TGCCGCCTCG TTCCTGGTGG TCGAGCTGAT GCGCCAGGGG
CGCTCGCCGG CGCAGGCCTG CCGTGAAGCA ATTGACCGCG TGGTGCGCAA GCGCCCCGAA
GCGAGCAAGA CACTGCAGGT CTGCTTCCTG GCCATGAACA AGCAGGGTGA GGTGGGCGCT
TACGCGCTGC ATCGCGGTTT TGTCTACGCC GTGTGCGATG CGCAGCGCCA GGATGACCTG
CGTGATTCGC CGTCGATCTA CACGAGCACC CAGACGTGA
 
Protein sequence
MVDRRQFLQA GALAAGMAAL PGVQARTQGG AKVVSTWDFG VPANQAAWKV LAQGGSALDA 
VEAGARWAES ELCNPTVGHC GNPDRDGVLS LDASIMDGDG RCGAVAALVD ILHPVSVARK
VMENSPHVLL VGEGAQQFAV QQGFERKHLL TPQAEAAWHE WLKTEKYQPQ INAERRGIPG
NSDNHDTIGM LALDAKGHLA GACTTSGMAW KLHGRVGDSP IIGAGLYVDN DVGAATASGV
GEEMIRNAAS FLVVELMRQG RSPAQACREA IDRVVRKRPE ASKTLQVCFL AMNKQGEVGA
YALHRGFVYA VCDAQRQDDL RDSPSIYTST QT