Gene Namu_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2649 
Symbol 
ID8448261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2895491 
End bp2897353 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content65% 
IMG OID645041743 
Producthypothetical protein 
Protein accessionYP_003201986 
Protein GI258652830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0121345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00491611 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGCCG ACCTGAGCGA CGTCCGGATC CATACCGGCA GCGAACCTGC CAATCTCGCG 
AGATCGGTGC AGGCGACGGC GTTCACGCTT GGAAGAGATA TCTATTTCGG AGCGGGTAGC
TACGCGCCGC ACACCACGTC CGGGCGACGG TTGCTGGCCC ACGAGCTGGC CCACACCGTT
GAGCAGGGTG CTGCCGCATC CGGACCGGGT CCGGTCATCG GCCGCGCTGC CGACCCCGCG
GAGAAGCAGG CCGATCGGGT CGCGGACAAC GTTCTGCGGG TGCTGCGGCA GCAAACCCAT
AATCCCGCGG ATCCAGGTGG TGCCACGCGC GGTGACGGCA GTTCGACCTC GCCTTTGTCG
GCGCTGCGGG AGCCGTCAGC GGGCGACACC GCGCCGGGCC GGGCACCCGG TGAGGCGCCC
CGGTCTGGGC TCGTCGGTCG TCAACTGCGA CGCATGGTCG GTTTCGAGGC AGAACTGATG
GTGCCCAGCC TTGGGCCGAG CGCCAACCAA CTCAAGTACA CCAAGGATCC GGACGACGTC
ACCGATTCCA TCAAGTCGTT CCTGGACGGC GGCGTTCCCT ACGGCACCGA CATCGGCGGC
AAGACCGCGG ACGTCGACGT GCGCCTGGAC AGCGATCACG GGGGGTCGAT CGATCGCACC
CCGATCGTGA GCAAGCTCGC CGAGCTGGGC TGGATCTCCG GCAAGCCCTC CGAACCGAGG
ACCAAGATCG AATTCGTCAC CAAGGCGGTG GACGAACTTG CGCCTGGCTC CAACAAGAGG
CTCAGGACCG TCGGACTGGC TTTGAAGGGC CAGCTCACTG ACGCCCTGTC CCAAGCCAAG
AGCGGGCAGA TGAAGCAGCT CGGGGCGCCC GCGAAGGCCG GCTACATGAC CGGTGTGCCC
GTCGCCGATC TCGAATGGTG GTGGTTGATG GGCAAGGAGT CCAGCGAAAT GGACGCCATG
GTCCAGGACT ATCTGACGAA CGGGATTCAG GACGATGTTT ACCTGCAGGC GACGGTGGGG
GTCGTCCCCT CCTCCTTGAT AAAGTTCTTT GCTCAGGCAG CGCTGCCCGG CGGGAAGGTG
GAACTCGCCC CGCCCTCACA GGCACGCCAA CAGATTCTCG GCCTGGTGCA GGAGGTCACC
TCCGATCTGG AGAAGAAGTT CACGGCGGCC CCGGAGGAGC ATTGGGTCAA GAAGCTCGAC
CAGGTGTCGA AGGATGCATT CCTGGGGCTT CTGGGCCTGA TCTACAGCTA CCTGCTGGGC
GACACGTTGC ATCAGACTTC CGGGGGAACG CTCTCCACGG TCAAGAATGC CGTTCCCTTC
CTCATCAAGA TGAGCCCGTA CGGCCTGCTT GCCAGTACCG CACCGCACAT GCTCAAGGAC
AGTCCGCCGC CACGGGAATT CGTGCGCAGT ATCGGCAGCT TCTTGAAGAA GTCCAACTAC
CTGCAGGTTG CCTACTGGGT CGAGGAGGCA CGAAAGGAAG GGCCGACCGC GGTCGGCGAG
GGCAAACTCG GCGCGAAGCT CGAGGCGCGC CCGAGCTCGA CGCGCCTGGT CAAGGGCGAC
TACACCGACT TCGTCGAGCA GGTTCTCCTG GGCTCGGGAG GGGCGATCGA GGTGGTGGTA
GGGAAGGCGT TGCCGGCACC CGACAAGCCG CCCACCGACT CCGGGGGCGT CGATGTGTTC
TTCGAGCTCT ACAACCAGAG CGGGATTCCG CTGGAGTATC GCGCGATCAC CAAGCGCTAC
AAGGTCTCCG AAGTCCTGCC AGCCATCGGT GAGATCATCA GCGACGTCCG GATGGCCGGC
ATGAGTGGGC TGACCGAGGA GCAAAAGGCC AAGGTCAAGG AGGCGTACGA GAGCGATGTC
TGA
 
Protein sequence
MGADLSDVRI HTGSEPANLA RSVQATAFTL GRDIYFGAGS YAPHTTSGRR LLAHELAHTV 
EQGAAASGPG PVIGRAADPA EKQADRVADN VLRVLRQQTH NPADPGGATR GDGSSTSPLS
ALREPSAGDT APGRAPGEAP RSGLVGRQLR RMVGFEAELM VPSLGPSANQ LKYTKDPDDV
TDSIKSFLDG GVPYGTDIGG KTADVDVRLD SDHGGSIDRT PIVSKLAELG WISGKPSEPR
TKIEFVTKAV DELAPGSNKR LRTVGLALKG QLTDALSQAK SGQMKQLGAP AKAGYMTGVP
VADLEWWWLM GKESSEMDAM VQDYLTNGIQ DDVYLQATVG VVPSSLIKFF AQAALPGGKV
ELAPPSQARQ QILGLVQEVT SDLEKKFTAA PEEHWVKKLD QVSKDAFLGL LGLIYSYLLG
DTLHQTSGGT LSTVKNAVPF LIKMSPYGLL ASTAPHMLKD SPPPREFVRS IGSFLKKSNY
LQVAYWVEEA RKEGPTAVGE GKLGAKLEAR PSSTRLVKGD YTDFVEQVLL GSGGAIEVVV
GKALPAPDKP PTDSGGVDVF FELYNQSGIP LEYRAITKRY KVSEVLPAIG EIISDVRMAG
MSGLTEEQKA KVKEAYESDV