Gene Namu_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3075 
Symbol 
ID8448689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3391379 
End bp3392650 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content73% 
IMG OID645042157 
Producttryptophan synthase subunit beta 
Protein accessionYP_003202398 
Protein GI258653242 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000212067 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000123729 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGTGC CCGCGCGAGC GGCCGTCGAT CTGCCACGCC CGTCCGACGG GCTGGCCGGC 
ACCGACCACG ATCCCGACGA CCGCGGCTAC TTCGGCACCA TGGGCGGCCG GTGGTTGCCC
GAGGCCCTGG TCGGGGCCCT GGACGAGGTC GCCGACTACT ACCGCAAGGC CCGCCGGGAC
CCCGATTTCC TGGCCCGGCT GGACGACCTG GCGGCCAACT ACGCCGGCCG CCCGAGCCCG
CTGTCCGACG CCCCCCGGCT GACCGCCGAG GTCGGCGGCG CTCGGATCCT GCTCAAGCGC
GAGGACCTGA ACCACACCGG CAGCCACAAG ATCAACAACG TGCTCGGGCA GGCCTTGCTG
GCCCAGCGCA TGGGCAAGAC CCGGCTGATC GCCGAGACCG GGGCCGGCCA GCACGGGGTG
GCCACCGCCA CCGCCGCGGC CCTGCTCGGC CTGGAGTGCT GCATCTACAT GGGCCGGGTC
GACACCGAAC GGCAGGCCCT GAACGTGGCC CGGATGCGGC TGCTGGGCGC CGAGGTCGTC
GCCGTCGAGG CCGGCTCGGC CACCCTCAAG GACGCCATCA ACGAGGCGTT CCGGGACTGG
GTGGCCACCG TCGACCACAC CTTCTACCTG TTCGGCACGG TGGCCGGCCC GCATCCGTTC
CCGGTGATCG TCCGCGACTT CCAGCGGATC ATCGGCCTGG AGGCCCGGGC CCAGGTGCTC
GACCGCACCG GCCGGTTGCC CGACGCGGTC GCCGCCTGCG TCGGCGGCGG CTCCAACGCG
ATGGGCATCT TCCACGCCTT CCTGGACGAC CCGGACGTGC GGCTGGTCGG CCTGGAGGCC
GGCGGCGACG GCATCGAGAC CGGACGGCAC GCCTCCACCA TCAGCGGCGG CTCGGTCGGG
GTGCTGCACG GCGCCCGCTC CTTCCTGCTG CAGGACGCCG ACGGCCAGAT CATCGAGTCG
CACTCGATCA GCGCCGGACT GGACTACCCC GGCGTCGGCC CCGAGCACTC GCACCTGGCC
GAGATCGGCC GGGCCGAGTA CCGCTCGATC ACCGACACCC AGGCCATGGA TGCGTTCGCG
CTGCTGGCCC GGACCGAGGG CATCATCCCG GCCATCGAGT CCGCGCACGC CGTGGCCGGG
GCGCTGGACC TGGCCCGGGA GATCGGCCCC GAGGGCATCG TGTTGATCAA CGTCTCCGGC
CGGGGGGACA AGGACATGGA GACGGCCATG CAGTGGTTCA AGCTGGCCGA ACCGACGGGA
GCCGTCCAGT GA
 
Protein sequence
MSVPARAAVD LPRPSDGLAG TDHDPDDRGY FGTMGGRWLP EALVGALDEV ADYYRKARRD 
PDFLARLDDL AANYAGRPSP LSDAPRLTAE VGGARILLKR EDLNHTGSHK INNVLGQALL
AQRMGKTRLI AETGAGQHGV ATATAAALLG LECCIYMGRV DTERQALNVA RMRLLGAEVV
AVEAGSATLK DAINEAFRDW VATVDHTFYL FGTVAGPHPF PVIVRDFQRI IGLEARAQVL
DRTGRLPDAV AACVGGGSNA MGIFHAFLDD PDVRLVGLEA GGDGIETGRH ASTISGGSVG
VLHGARSFLL QDADGQIIES HSISAGLDYP GVGPEHSHLA EIGRAEYRSI TDTQAMDAFA
LLARTEGIIP AIESAHAVAG ALDLAREIGP EGIVLINVSG RGDKDMETAM QWFKLAEPTG
AVQ