Gene Namu_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4001 
Symbol 
ID8449620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4418210 
End bp4419409 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content75% 
IMG OID645043046 
Producthypothetical protein 
Protein accessionYP_003203282 
Protein GI258654126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0000201324 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.340797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCAC GCCACGGCTC GGGGTCCGGC GGGGCACGCA TCGCCCCGTG GATCATCGTC 
GCCGTCGTCA GCGTCGTCGT GATCGCCGGC GCGGTGACCG CCTACGTCTT CATCACCCGG
GACAACAAGG CCGCCGCCAC CTGTACCAGC CAGGTCGTGC TCGAGGTCGT GGCCGCCCCC
GGCGCGGCGC CGGCCATCGA GGCCGCGGCC GCCGCGTTCG ACGCCACCAA TCCGGTCGCC
CGATCGGCCT GCGTCACCAC CGACGTGACC GCGGCGCCCG GGTCGCAGAC CGCCAGCGAC
CTGGCCGACG GGTGGACCGC CCAGCCGAGC CAGGGGCCGG CGCTGTGGTT CCCGGACAGC
GCCGCCGATC TGGCCACCCT GGAGACCCAG GACTCGGCGA TGACAGCCGG CCGCAACCCG
GCGCCGATGG CCGCGTCACC GGTCGTGCTG GCCGTGCGCA GCACGGACGC GGCCGCGGTG
ACCGCCGCCA ACCTGCAGTG GAAGGACCTG ATCACCGCCG CCGGACCCAC CGGATCGGTG
ACCCTGCCCG ACGGCGGCAA GCTGATCCTG GCCTTGCCCG ATCCCACCAC CAACCGCGCC
ACCAGTGACG CGCTGCAGTC GGTCCTGGCC GGGACGACGT CGGCGACCAT CGACCCGTCG
GTGGTCGCGG CGAACGCCCC CGCCCTGGCC GGGCTGGCCG CCGGTGGGCC AGCCGTCCCG
CCGGCCACCA CCCTGGACGC GTTGGCCGAC CTGCAGGCCG GCAACGCAGG TTTCGCCGCC
GTGCCGATCG TGGCGTCCGA GTTCGCCCAA CTGGCCGAGC AGAATCCCGG GTTGACCACG
GTGAGCCTGG GCGGTCCGAC CGGAGGTGAC CAGATCTTCG GCGTGCCGAT CACCGCCAGC
TGGGTCAACC CGACCATGGA CGACGCGGCC AGCGCGTTCC TGGCCTACCT GCGAGGACCG
GCCGGAGCGC AGGTGCTGAC CGACCAGGAT CTGGCCGCCG CCTCCGCCGT CTCCCTGGCC
GATGCCGGGG CGTCGGTCGA CGCGGCCCTG GCCAGCGCCA TCGGCAGCCC CGGCGCGACC
GGCGCCGCGC CCACCGCCGA CGGCACCGCG CCCGCGACGG CAACCCCGGG GCCGTCCGGT
GCCCCGACCT CGGGCGCATC CACCCCCACC ACGACCACGA CCACGACCAC GGGATCCTGA
 
Protein sequence
MTSRHGSGSG GARIAPWIIV AVVSVVVIAG AVTAYVFITR DNKAAATCTS QVVLEVVAAP 
GAAPAIEAAA AAFDATNPVA RSACVTTDVT AAPGSQTASD LADGWTAQPS QGPALWFPDS
AADLATLETQ DSAMTAGRNP APMAASPVVL AVRSTDAAAV TAANLQWKDL ITAAGPTGSV
TLPDGGKLIL ALPDPTTNRA TSDALQSVLA GTTSATIDPS VVAANAPALA GLAAGGPAVP
PATTLDALAD LQAGNAGFAA VPIVASEFAQ LAEQNPGLTT VSLGGPTGGD QIFGVPITAS
WVNPTMDDAA SAFLAYLRGP AGAQVLTDQD LAAASAVSLA DAGASVDAAL ASAIGSPGAT
GAAPTADGTA PATATPGPSG APTSGASTPT TTTTTTTGS