Gene Namu_4071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4071 
Symbol 
ID8449691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4486378 
End bp4487550 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content70% 
IMG OID645043115 
ProductThiamin pyrophosphokinase catalytic region 
Protein accessionYP_003203350 
Protein GI258654194 
COG category[S] Function unknown 
COG ID[COG4825] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.154875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.112383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCT TGTCACGTCA GTCACGCGCG TTGCCGGGTG TCACAGGAGT GGCGCGCGTG 
TCCCGTCGCA GCGAGGCGCT GCTCGATCGG GTGGGCCACA AGGACATCGT CGTCATCGAC
CAGGTCGACA TCGACCGGGC GACCGCGGAC GCCCTGGTCA AGGCGGGCGT CACCGGGGTG
ATCAACGCAG CCCAGTCCAT CTCCGGCCGG TTCCCCAACC TGGGTCCGGA GATCCTGGTC
GCCTCCGACG TCCTGCTCGT CGACGGGGTG GGCGACGGCG TGTTCGCCAA GGTCAAGGAC
GGGGCGAAGA TCCGGGTCGA CGGCGGGTCC GTCTACATCG GCGACGACGT GATCGCGCAG
GGCACCGAGC AGACTCGGGA GTCGATCGCC GACCTGCTGA TCGAGGCCAA GACCGGCATG
TCGGCCCAAC TCGAGGCCTT CGCCGCCAAC GCCATCGAGT ACATGAAGCG GGAGCGCACC
CTGCTGCTCG ACGGGGTCGG CATCCCCGAG GTGGCCACCG AGTTGGAGGG CCGGCAGGTG
CTGCTGGTCG CCGGCTCCGC GGACACCCCC AAGCAGCTCA AGTCGCTCAA GAAGTACATC
TCCGATTTCC GGCCGGTGCT GGTCGGCGTC GACGGCGGCG CCGACGCCCT GCGCGCCGCC
GGCTACAAGC CCGCGCTGAT CATCGGCAAC CCCGAGCACA TCGACTCGGA GACGCTGCGA
TGCGGGGCCG AGGTCGTCAT CCCCGCGCAC ACCGACGGGC ACGCCCCCGG CCTGGAACGG
CTGCAGGACC TCGCCGTCGG AGCGGTCACC TTCCCGGCCA CCGGCTCGTC GGAGGATCTG
GCCCTGCTGC TGGTGGACGG CCGCGGCGCG TCGATGATCG TCACCGTCGG CCTGCAGGCC
ACCCTGGTCG ACCTGCTCGA CCGGGGCCGG GGCTCGACCG CCTCGACCTT CCTGGTCCGC
ATGCGGGTGG CCAACAAGGT GGTCGAGTCG CCCGTCGTCG CGTCCCTGTA CAAGTCGCGC
ATCTCCTGGT GGGTCGTGCT GATGCTGGTG GTCGCGGCCG CCGTGGCGAT GGCGACCGCC
CTGATGGTCG CCGACGTCGG CGGCGTGTAC GCGGACCTGG TCCGCGACTG GTGGAACCAG
CTGGTCGAGT GGGTCAGGGG ACTGTTCGCA TGA
 
Protein sequence
MKLLSRQSRA LPGVTGVARV SRRSEALLDR VGHKDIVVID QVDIDRATAD ALVKAGVTGV 
INAAQSISGR FPNLGPEILV ASDVLLVDGV GDGVFAKVKD GAKIRVDGGS VYIGDDVIAQ
GTEQTRESIA DLLIEAKTGM SAQLEAFAAN AIEYMKRERT LLLDGVGIPE VATELEGRQV
LLVAGSADTP KQLKSLKKYI SDFRPVLVGV DGGADALRAA GYKPALIIGN PEHIDSETLR
CGAEVVIPAH TDGHAPGLER LQDLAVGAVT FPATGSSEDL ALLLVDGRGA SMIVTVGLQA
TLVDLLDRGR GSTASTFLVR MRVANKVVES PVVASLYKSR ISWWVVLMLV VAAAVAMATA
LMVADVGGVY ADLVRDWWNQ LVEWVRGLFA