Gene Namu_0722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0722 
Symbol 
ID8446309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp791017 
End bp792210 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content72% 
IMG OID645039857 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003200125 
Protein GI258650969 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCT TCACCGTCGT CGACGTGATC GTCGTGCTGC TGGTGGTCGC CGCGGCGATC 
GCCGGCTACC GGCAGGGCTT CATCACCGCG ATGTTCACCC TGGTCACCGC GGTGGCCGGG
GCGATCGTGG CGATCTGGCT GGCGCCGATG GTGATGGACC TGGTCCAGGA CTCGACGGCC
AAGATCGCCA TCGGCATCGC CTGCGTGATC GTCGGGGTGG GCGTCGGCGA GATCGCCGGG
GCCACGGTCG GCCGGGCGAT CTCGCGCAAG ATCAGCTGGC GGCCGGCGCA GGCCGTCGAC
CGCACGCTCG GGCTGTTCGG CAACGCGCTG GCCGTGCTGC TGGTGATCTG GCTGATCGCG
GTGCCGCTGG CCGCCGTGCC GTTCCCCTGG CTGTCCTCGG CCATCCGCGG CTCGGCCGTG
CTCGGCAAGG TCGATGAGGT CATCCCGAGC CAAGCGCAGG ACCTGTCCAT CCGGCTGCGC
GAGGTGTTCA ACGGGTCCGG GTTCCCGGCC ATCCTGGATC CGCTCGCGCC CACCCCGAAC
ACCGCGGTCG ACCCGCCCGA CCAGCAGGTG GTCGCCGCCA GCGGGGTGGC AGCCGCCGCG
AACTCGATCC TGAAGGTGCG GGCGACAGCC GAGTCGTGCG CGCGCCGGAT GGAAGGCACC
GGTTTCGTCA TCGGACCGGG CAAGGTGCTG ACCAACGCGC ACGTCGTGGC CGGCTCCGAC
CGCGCGGTGG TGGAATCGGC CGACGGCAAT CTGCGGGCCA CGGTCGTGCT GTACGACCCG
CAGACCGACC TGGCCGTGCT GGACGTGCCC GACCTGAGTG CCCCGGCCCT GCCCTGGGCG
GAGCAGCCGG CGGCGTCCGG CTCGGACGCC GTGGTCGCCG GGTTCCCGCT GGACGGCCCG
TACACACTGG TCCCGGCCCG GGTGCGGTCG GTGATCCAGC TGCGCGGGCC GAACATCTAC
TCCAACGCCA CGGTGACCCG TGAGGTCTAC ACGCTGCGGG CGCAGGTGCA GCCGGGCAAC
TCCGGGGGCC CGTTGCTGGC CCCCGACGGC AGCGTGCTCG GGGTGATCTT CGGCGCGGCG
ATCGACGAGA CCGACGTCGG GTTCGCGCTC ACCGCGGCCG AGGTGGCTCC GGTGGTGCAG
GCCGGGCTGG TCGACGACAC CGCCGCCTCG ACCCAGAGCT GTACGGCCGC CTGA
 
Protein sequence
MTGFTVVDVI VVLLVVAAAI AGYRQGFITA MFTLVTAVAG AIVAIWLAPM VMDLVQDSTA 
KIAIGIACVI VGVGVGEIAG ATVGRAISRK ISWRPAQAVD RTLGLFGNAL AVLLVIWLIA
VPLAAVPFPW LSSAIRGSAV LGKVDEVIPS QAQDLSIRLR EVFNGSGFPA ILDPLAPTPN
TAVDPPDQQV VAASGVAAAA NSILKVRATA ESCARRMEGT GFVIGPGKVL TNAHVVAGSD
RAVVESADGN LRATVVLYDP QTDLAVLDVP DLSAPALPWA EQPAASGSDA VVAGFPLDGP
YTLVPARVRS VIQLRGPNIY SNATVTREVY TLRAQVQPGN SGGPLLAPDG SVLGVIFGAA
IDETDVGFAL TAAEVAPVVQ AGLVDDTAAS TQSCTAA