Gene Namu_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0020 
Symbol 
ID8445599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp23132 
End bp25150 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content74% 
IMG OID645039171 
Producthypothetical protein 
Protein accessionYP_003199447 
Protein GI258650291 
COG category[S] Function unknown 
COG ID[COG4425] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGC AGCAGTGGGC GGCCAAGCTG GCCGGGCCGG GCGATCAGAC CGGCCTGCTG 
GTCGCGGCGG CCATCGCGCC GGGCACGTTC GAGCCGTCCC TGGTCCCGCG CAGCCTGCCG
GACCAGGCGA TCGTCACCGG TATCGGCACC ACGCTGTCCT ACGTCCTGAC CGTCGCCACC
CAGGACAGCA TCGAGGCGTT GGCCTCGGGC CTGGCCGGCC GGGTGGGTCG CGGCAGCGCG
CCCACCCGGC AGCGGCGGGC CGCGCTGCTG ATCGACCTGG CCGTCATCCC TGTCGGGCTG
GCCGCCGCCG GGGCGCTGCG CCGCCGGCCG CAGGAGTCCA CCCTGCGCGG GGTGGCCCGG
CAGGTCGCCT GGCGGACCGG CGTCACCGGC CTGGGCGCGA GCCTGTTCAC CCTGACCGAA
CTGGGCCTGA CCGCCGCGGA CGGCGCGCTG GGCGCCGGCG GCCGGATCGC CCGGATCCCG
CTCGGTGCGC CGGTCGGTCT CGGTGTCGGG CTGGCCCTGG AGGCCTACCG GCGGCGCGGC
CTGCCGGCCG AGCCGGTCAC CACCGACGCG CCGGCGACCG AGCCGCTGCG GGCGGCGGTG
GCCAGCGTGG GCGTGGTCGG GATCGTCTCG GTCGGCGCGC TGGCCCAACG GCTGCTGGCC
GACGGCACCG CCGCCCTGCT GGCCGGCGTG CTGCCGGGCG GCCGGGTGGT GTGGCGGCCG
ATGGGACACG TGCTGACCCT GGCCGCGGTG GCCGGGGCGG GCACCGTCCT GTGGCACCGG
GCGATCCGGT CGATCGAGGC CGGCACCAAC GTGATCGAGC CGATCATCGA CGACGAGGGC
GGCCGGCGAT GGGTCGGCGA GCACGCCAGC GGGTCGGCCA ACAGCCTCAT CCCGTGGGCC
ACGCTCGGCC GCGAGGGTCG CCGGCACGTG CTGCTCTACC CGCGGCCGCA GCCGGTCCGG
GACCTGCCGG TGAGCCTGCC CGACCTGGCC ATCAGCTCGG TGATGGGCGA GCCGGCCGCC
GCCGAACCGG CCCTGGTCTA CGTCGGGCTG GACAGTGCGC CGACCGCCCA GGCCCGGGTC
GACCTGGCCC TGGCCGAGAT GGATCGGGTC GGCGCCTGGG ACCGGTCGCT GATCATGCTC
TGCTCGCCCA CCGGCTCGGG CTACCTGAAC TACTGCGCGA CCGCCGCGGC GTCCTACCTC
ACCCGCGGCG ACCTGGCCAT CGTGACGATG CAGTACTCCA AGCGGCCGTC GCCGCTGTCG
CTGTTCAAGG TCAAGGATGC CCGCGAGCAG AACCGGTTGC TCTGGCTGGC GATCAGCGCC
CGGCTGCGGG AGCGGTCCGG TCCCCGACCC CGGGCGGTGC TCTTCGGGGA GAGCCTGGGT
GCTCACACCA GCCAGGACGT GCTGCTGCAC TGGGGCACCC TGGGACCGCA GGCCCTGGGC
ATCGATCGGG CGTTGTGGAT CGGCACGCCG TACGGCAGCG GCTGGATGCA CCAGGTGACC
GGCGAACCCC GACCCGACGT CGACCCGAAC CTGGTCGCCA TGGTGAACGA CTTCGAGCAG
CTGCAGGCGC TGCCCGAGAG CCATCGGGCC ACCCTGCGGT ATGTGATGGT CAGTCACGAC
AACGACGGCG TGACCAAGTT CGGCGCCGAC CTGCTCACCC GCCGCCCGGC CTGGCTCACC
GACGCGCGGC CGGCGGTGCA GATCGTGCCC GGGGCCAGCC CCCGCGGGAT TCCGGCCCGG
CTGCGATGGC GGCCGATCAC CTCGTTCCTG CACGGGCTGA TGGACATGAA GAACGCCCAG
AAGATGCAGG GTTACCGGGC CTGGGCCCAC GACTACCGTC CCGACATCGC CCGATTCGTC
GCGCAGGTCT ACGACCTGCC GGCCACCCCG GCGCAGCTCG CGCGGATCGA GACCGCGCTG
CAGGCCCGCG AGGAGATCCG GGACCGGCTG CTCTCCCAGC ATCTGGACGC GATGACCCCA
TCCCCCGATC GGTTCGTTCC TCCGGCGCCC GAGCGATGA
 
Protein sequence
MGLQQWAAKL AGPGDQTGLL VAAAIAPGTF EPSLVPRSLP DQAIVTGIGT TLSYVLTVAT 
QDSIEALASG LAGRVGRGSA PTRQRRAALL IDLAVIPVGL AAAGALRRRP QESTLRGVAR
QVAWRTGVTG LGASLFTLTE LGLTAADGAL GAGGRIARIP LGAPVGLGVG LALEAYRRRG
LPAEPVTTDA PATEPLRAAV ASVGVVGIVS VGALAQRLLA DGTAALLAGV LPGGRVVWRP
MGHVLTLAAV AGAGTVLWHR AIRSIEAGTN VIEPIIDDEG GRRWVGEHAS GSANSLIPWA
TLGREGRRHV LLYPRPQPVR DLPVSLPDLA ISSVMGEPAA AEPALVYVGL DSAPTAQARV
DLALAEMDRV GAWDRSLIML CSPTGSGYLN YCATAAASYL TRGDLAIVTM QYSKRPSPLS
LFKVKDAREQ NRLLWLAISA RLRERSGPRP RAVLFGESLG AHTSQDVLLH WGTLGPQALG
IDRALWIGTP YGSGWMHQVT GEPRPDVDPN LVAMVNDFEQ LQALPESHRA TLRYVMVSHD
NDGVTKFGAD LLTRRPAWLT DARPAVQIVP GASPRGIPAR LRWRPITSFL HGLMDMKNAQ
KMQGYRAWAH DYRPDIARFV AQVYDLPATP AQLARIETAL QAREEIRDRL LSQHLDAMTP
SPDRFVPPAP ER