Gene Namu_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4271 
Symbol 
ID8449897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4750872 
End bp4751921 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content76% 
IMG OID645043319 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003203548 
Protein GI258654392 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.189844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGCT CCCGGGCGAC CCTGGCCCAG GTCGCGGCCC GCGCCGGCGT CTCGGTCTCG 
ACCGCCTCGC TGGCCTTCAG CGGGTCCGGC CCGGTGTCGG CGGCGACCCG CGAGCGGGTG
CTCGCCGCGG CCGAGCAACT GCGCTACGCC GGTCCCGACC CGCGGGGCCG CTCCCTGCGG
CAGGGCCGCT CCGGGATCAT CGCGGTGGTC ATGGAGGACC GGGTGCTGGC CGCCTTCCGC
GATCCGGTGC GCATCGCCGT GCTCGACGGG ATCGCCCAGG AGACCTCGGC CCAGGGCCAG
GGGCTGCTGC TGCTCTCCGA CGTCGGGGAG AGCGCGGACG CCATCGGCAC CGCCACCATG
GACGCGGCCA TCCTGCTGGC CTTCAGCTAC CGCAGCGACC CCACGGTCGA ACTGCTGCGC
CGCCGGGTGG TCCCGCTGGT CGCCCTGGGC GGCCCCGACC ACGGCCTGCT GACCATCTCC
ATCGACGACG AAGCGGCCAG CGCGGCGGCG GCCGCCCACC TGGCCGGCCT GGGCCACACC
GACGTCGCGA TCGTGACGCT GCCGCTGGTC AACGTGGATT CGCCGGCGGC CCGCGGGCCA
CTCACCGCCG ACGCGATGCG CGCCGCCTCA GTGACGGTCT CGCTCACCCG GCTGCGCGGC
GCCCGGTCGG TCTTCCCGGC CGCCGCGGGC TGGGTCAGCG CGGCCAGTTC GGTCGACGAG
GGCATGATCG CCGGGCAGGC GCTGCTGGCC GATCCCGCGC GCCGGCCCAC GGCGGTGATC
GCGCAGAGCG ACCTGCTGGC CGCCGGCGTC ATCCGGGCCG CACACGAGCT CGGCCTGTCC
GTGCCGGGCG AGCTGTCGGT GATCGGGTTC GACGGGATCC CGCTGGACCG GATCATCCCG
CAGGACCTGA CCACGATGGT GCAACCGGCC GCCGCCGAGG GCCGGGCCGC CGGTCGCGCC
GTGCTGGACC TGCTGGCTGG GGAACACCCC CGGTCCACCA GCTTCCAGTG CACGTTCCAC
CCGGGCGCCA CCACCGCCCG CCCCGCCTGA
 
Protein sequence
MSGSRATLAQ VAARAGVSVS TASLAFSGSG PVSAATRERV LAAAEQLRYA GPDPRGRSLR 
QGRSGIIAVV MEDRVLAAFR DPVRIAVLDG IAQETSAQGQ GLLLLSDVGE SADAIGTATM
DAAILLAFSY RSDPTVELLR RRVVPLVALG GPDHGLLTIS IDDEAASAAA AAHLAGLGHT
DVAIVTLPLV NVDSPAARGP LTADAMRAAS VTVSLTRLRG ARSVFPAAAG WVSAASSVDE
GMIAGQALLA DPARRPTAVI AQSDLLAAGV IRAAHELGLS VPGELSVIGF DGIPLDRIIP
QDLTTMVQPA AAEGRAAGRA VLDLLAGEHP RSTSFQCTFH PGATTARPA