Gene Namu_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4224 
Symbol 
ID8449850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4675435 
End bp4676874 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content72% 
IMG OID645043273 
Productmembrane-flanked domain protein 
Protein accessionYP_003203502 
Protein GI258654346 
COG category[S] Function unknown 
COG ID[COG3402] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00512848 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.19877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTACC CGGACAATCT GCTGGTCGCC GGCGAGAAGA TCTATGTGCG CAAGCGCCCG 
CACTGGAAGG TGCTGATCCT CCCGACCCTG TTCTTCGTCC TCATCGTCGG TGGTGGGGCG
GCGATCATCG CCTTTGCCAA CAGCCGGGGC TGGGACTGGC CGAGCTGGGC GGACTGGGCG
ATCGTCGCGA TCGCGGTGAT CGCGTTGATC ATCCTGGTGG TGGTGCCGTT CATCCGCTGG
CGCACCGAGC ACTTCGTCAT CAGCAACCAC CACATCTTCT TCCGCACCGG CCTGCTGTCC
CGGCGCGAGC ACCAGATCCC GCTGGGCCAG ATCGCCAACA TCGAAACCGA GGTCACCTTC
TGGGGCCGGT TGATGGGGTA CGGCTCGCTG ATCGTCGAGT CCTCGGCCGA CCAGCCGCTC
AAGTTCCGCA ACGTGGCGAA CCTGTCCAAG GTGCAGTCGA TGCTCAACCA GTTGATCCGC
AACGAGAAGG AACTCACCCA CCGCGGCATG GTGGAGGTCG ACGACTACGC CGCCGACGAC
TCCGACCGGG AACGCGCGGG CGACCGCTCG AACCAGGCGC CGACCGGCCA GTGGGCGGCC
GCGACCGACT CGAACCAGAC CGGCGGCTAC GGACCGGCCA CCGGGGCGCA GCCGGCGTAC
GGGCAGGGGT ACCCGCCGGC CCAGGGGTAC CCACCGGCGG CGGGTTACGC GCAGCCCGGG
TACGGGCCGG CCCACCCGCA GACCGGTCCG ACCCAGGGCT ACCCGCCGCC CGGTCAGCCG
CAGGGTTACC CGCCGCCGGG CCCGACCCAG AGCTATCCGG CCGGCCACCA GCCCACCCAG
TCGCCGGGGT ATCCGCCGTC CGGGTATGCG ACGGGCGGGT ATCCGCCGAA CGGCTACCAG
CAGCCCGGGT ATGCCCCGAA CGGGTACCAA CAGCCCGGGT ACCAGCAGCC CGGACACGCG
TCGGGCGGCT ACCCGCCGCC GGCCGCACCC GCGAACTACT CGCAGCCCGG CCCGCCGCAG
CCCGGGTATT CGCAGTCCGG CTATTCCCAG GCCGGATATG CCCAGCCCGG TTCGTCGCAG
CCGGCCGAGG GCCAGGCCGC CCCGCCGTTC ACGCCGGCCG CCGGGCCGGG TGAGCAGCCG
CCGGCGTCGG CCGAGCCGAC AGTCATGACC CGGCTGCCGT CCGCCGCCCC GGTGTCGTCG
AGCGACTCGG GGGCCCCGGC AACCCCGCCT ACCCCGCCGA CTTCGGACCG GGGCCGGCAC
GCCCGGCCCA CCGAGCAGAC GCCGCTGCCG AATTTCGCCG AGCCCGACCC AACGGTGATC
GTTCGCTCGC CCGCGTCGTC GGGTGATTCG GCGTCGGGTG CACCGGCGGG TGCACCGGCA
AGCGCGCCGG CGCCGTCGGA GACCGACCCG GACCAGCCGG ACTCGAGCCA GCGATCCTGA
 
Protein sequence
MPYPDNLLVA GEKIYVRKRP HWKVLILPTL FFVLIVGGGA AIIAFANSRG WDWPSWADWA 
IVAIAVIALI ILVVVPFIRW RTEHFVISNH HIFFRTGLLS RREHQIPLGQ IANIETEVTF
WGRLMGYGSL IVESSADQPL KFRNVANLSK VQSMLNQLIR NEKELTHRGM VEVDDYAADD
SDRERAGDRS NQAPTGQWAA ATDSNQTGGY GPATGAQPAY GQGYPPAQGY PPAAGYAQPG
YGPAHPQTGP TQGYPPPGQP QGYPPPGPTQ SYPAGHQPTQ SPGYPPSGYA TGGYPPNGYQ
QPGYAPNGYQ QPGYQQPGHA SGGYPPPAAP ANYSQPGPPQ PGYSQSGYSQ AGYAQPGSSQ
PAEGQAAPPF TPAAGPGEQP PASAEPTVMT RLPSAAPVSS SDSGAPATPP TPPTSDRGRH
ARPTEQTPLP NFAEPDPTVI VRSPASSGDS ASGAPAGAPA SAPAPSETDP DQPDSSQRS