Gene Namu_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1101 
Symbol 
ID8446697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1221825 
End bp1223858 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content70% 
IMG OID645040238 
Producthypothetical protein 
Protein accessionYP_003200497 
Protein GI258651341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.682215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCGT TGTCGGCAGT GGACCTTCGA CTCGGGCTGG CCGACGTCGC CGCACTCGCT 
CTTGTGCAAC GCCCGGTCGT CTCGGTATGG CGGACCCGTA GCACCGGTAC CGACCACCCG
TTCCCCGCAC CCGTCGAGAC CCTGGACGGT CAGGAGTGGT TCGACGGTCG GCAGGTAGTC
GAGTGGCTCG AGGCCACCGG TCGAGGCAAC AACCCCGAAG CACGAGCGGA TTTGGCCGCG
TTTGCCAGCA TCGATGGCGG TTCGCCCGCC GGCGATCAGG AGGTCTTTTT CGGTCTGACC
GCACTGCTGT GCTTGACGGC GATAGTCGGA AGCCCCTTCG GTGAGACGGC CGCCGGGGAC
ATCCTGGACC TGGCTGACGA GGCCGACCCG GACAATGATC TGCTGTTCGG TGAGATCGAG
GAGCTCGGCG GACGAATCGG CGCCCTGGCC CACTACTGCG ACCTGCTGGT CGACGCGGCC
TACCACCCCG CTGCGGCGTT GGAGAAGCTG ATCGGCGAGC GATTCCGTCG CCATGTCCCC
GCACAGAGCA CAGTGGCCTT GACTGGGCCG GCCCACGAAC TCATCGGGTC CGCCGCGATC
GCACTGGCCG ACGGCGCCGG GCACGAGTCG CTGACCGTTG TCGACCCGAC CCCTGGCGGG
AGCGATCTCC TCCTCGCGGT CGCCGAGCGG GCCGGCGCGC GTGACCTGAC GGTGTTCACC
GGCCGATCAA ACGACGCGGC AGCCCGATTG GCCCGGCGCC GGCTGCGGGT ACACGGCGTT
CATCGCGCGG ACGGAGCGGG TAGCGACGCT TCCGGCGACC AGCCGGACCA GGCGATCCTG
CTCGCCCGAT ACCCGAGCCC GGGGCAGCCG GACGTGGCGG CGGCGGAGAT GCTCAACGCC
GTCACCGATC TGCTTCTCGG CACCACCCCC CAGCAGCGAT TCCTGGTCGC AGCACCAGCC
GCCGTGCTGA CCGATCGGCT CCGTGATCCC GGGGCGCGAA TGGCCAGGGA CCGGCTGATT
CGCTTCCCGA CGCTGCGCGC GGTCGTGCGG CTGCCGGCCG GGCTGGTCGC GGCTAAACCG
CGCGAGCGAC TGGCATTGAT GCTGTTCGGG CGTCGGCACG ACCCATCACC CGACGAGCCG
CAGATGCTCG TCGGTGATCT GAGCAACGCC GCGCTCAGCA ACGCGGTGAT CGACGACCTG
GTAACCGACC TCGTGGCCAG CACCTCGTCG CCCGCAATCG CACGAACACA CCCTTTCCGC
TTCCTGCGCC CGGTCATGGT GCGCCGGGTC CTGGCCACCG CCGGTTCGAT CGTCGAGGTC
GTCGGTCCCG GTCAGGCCCG GCCGGTCTCT GGGGCCGACT TGGTGCTGCG CATCACGGAG
ATTGTTGAGG GCATCGCCCG CCCGCTGGCG GCGCCAACCG TGCCGGAGAT GGCGGTGGCG
AGCCATCTCC AGAACCTCAT CCCGATCACC ACGCTCGGTG CGGCCAAGGA CCGCGGTGAC
GTTCGGATCC TGGCCGGGTT GCGGCTGGAC ACCGGGCTCG GCCCCGGCGG GGTTATCCTC
CTGGGCGAGT CGGAAGTGTG CGGCGCCGCC CGCGTGGGGG ATCGGACGGT CGATCGGCTC
GCCGTGATTG CCCGCCATCC GGCGGTCCAG TTCACGGAGC CGGGCGACGT GGTCTTCACG
AGCTCCCCGC GGCCGGGTGC GCTGGTCGAT ACCAACGGCA GCGCGGTCGT GACCTACCCC
GCGCGGATTG CCCGCATCGC CCGCCCCGAC TCCGGGCTGG CCGCCCGCCT GCTCGCCGCC
GACATCAACG CGCGGCCGGA AGGAGCGAAG GCCTGGCGCG GGTGGCCGCT GCGTCGGCTT
CCGTCCGACC AGGCGACGGT GCTCGACCAG GCCCTCGCGG AGATCGAGGA GTATCGGGTC
GACCTCGAAC GGCGCCGGCA CGACGCCGAG GATCTCGCCC GGCTGCTGGC CCGCGGTGCC
ACCGACGGCG CTGTCACCCT GACCACCACA ACCGAACCGA CGAAGGGAAC TTGA
 
Protein sequence
MSALSAVDLR LGLADVAALA LVQRPVVSVW RTRSTGTDHP FPAPVETLDG QEWFDGRQVV 
EWLEATGRGN NPEARADLAA FASIDGGSPA GDQEVFFGLT ALLCLTAIVG SPFGETAAGD
ILDLADEADP DNDLLFGEIE ELGGRIGALA HYCDLLVDAA YHPAAALEKL IGERFRRHVP
AQSTVALTGP AHELIGSAAI ALADGAGHES LTVVDPTPGG SDLLLAVAER AGARDLTVFT
GRSNDAAARL ARRRLRVHGV HRADGAGSDA SGDQPDQAIL LARYPSPGQP DVAAAEMLNA
VTDLLLGTTP QQRFLVAAPA AVLTDRLRDP GARMARDRLI RFPTLRAVVR LPAGLVAAKP
RERLALMLFG RRHDPSPDEP QMLVGDLSNA ALSNAVIDDL VTDLVASTSS PAIARTHPFR
FLRPVMVRRV LATAGSIVEV VGPGQARPVS GADLVLRITE IVEGIARPLA APTVPEMAVA
SHLQNLIPIT TLGAAKDRGD VRILAGLRLD TGLGPGGVIL LGESEVCGAA RVGDRTVDRL
AVIARHPAVQ FTEPGDVVFT SSPRPGALVD TNGSAVVTYP ARIARIARPD SGLAARLLAA
DINARPEGAK AWRGWPLRRL PSDQATVLDQ ALAEIEEYRV DLERRRHDAE DLARLLARGA
TDGAVTLTTT TEPTKGT