Gene Namu_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3900 
Symbol 
ID8449519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4301145 
End bp4302698 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content69% 
IMG OID645042946 
Producttransposase IS4 family protein 
Protein accessionYP_003203182 
Protein GI258654026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0647808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0233728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGGTC GGTCGCGGGA TCAGCGTGAG TTGTTGGATG CCGAGTCGGT AGTCGGTGGG 
CTGCTCAAGC CGGGCAGCGT GTTCGCGTTT CTGGCCGCGC ACCGGCGGGA GGTGTTCCCG
GACGGCATGT TCGCGGATCT GTTTCCGTCG GGTCGGGGCC GCCCGTCAGT GCCGGCCGAC
GTGATGGCCT CGGTGATCGT GTTGCAGTCG TTGCACGGCC TGTCCGACGC GGACACGGTG
GACTCGGTGA CGTTCGATCT GCGGTGGAAG GCAGCGTGCG GGTTACCGGT GACCGCTGCG
GCGTTCCATG CCACGACATT GACGTACTGG CGGCGTCGGC TGGCCGCTTC GCAGTCGCCG
AACCGGATCT TCGACGCGGT CCGCCAGGTC GTGGACCAGA CCGGGGTGCT GGCTGGAAAG
AGCAGGCGAG CGTTGGATTC CACGATCCTG GACGACGCGG TCGCCACCCA GGACACGGTC
ACCCAGTTGA TCGCGGCGAT CCGCCGGGTC CGCCGCGAGG TACCCGGCGC CGCCGAGGTC
GTCGGCGAGC ACTGCTCGGC TCACGACTAT GACGACCCGG GCAAACCGGC GATCGCCTGG
AACGATCAGC AGGCCCGCGA GGCCCTTGTC GATGCGCTGG TCACCGACGC GCATCGGGTG
CTGGGACACC TGCCCGACCA GGAGCTCGGA CCGAAGGCGG CGGACGCGGT CGCCCTCTTG
GCGTTGGTCG CCGGGCAGGA CGTGGAACCG GTCGAGGGCT CGGACGGCAC CGACGGACGG
TGGCGGATCG CGCAGCGGGT CGCCCCGGAC CGGGTGATCT CCACCGTGGA CCCGGAGGCG
CGGCACGCCC ACAAGACCGT CCACCGGCGG CAGGACGGGT TCAAGGCACA CATCGCGGTC
GAACCCGACA CCGGTCTGGT CACCGCCTGC GCGGTGACCA TGGCCAGCGG ACGCGGCAAC
AGCGACGCCG AGGTTGGACC CACCTTGCTG GCACAGGAGA CCGAAAAGCT GCACGTGCTG
GCCGATTCGG CGTACGGATC GGGATCCGCG CGGGCCGAAC TGGACCATGC CGGGCACATC
GCGTTGATCA AGCCGTTCCC GCTGCGGTCG GCCGTGCCGG GCGGGTTCAC CCTGGACGAC
TTCACCGTCG ACCCCGAGGC CAGGACGGCC ACCTGCCCGA ACGGGGTGAC CCGGTCGATC
ACCGCGCAAT GGTCCGTCAC CTTCGGAGCG GCTTGCCGCG GCTGCCCGCT CCGGGCCCAA
TGCACGACCA GCGACGCCGG TCGATCGCTG AAGCTGACCG AGTACGAAAG CCTGCTCAGG
GCGGCCCGTC GACAAGCGGA AACCGAGGAC TTCCAACAGG TCTACCGACG GCACCGGCCG
ATGGTCGAAC GATCGATCTC CTGGCTGGTC CGCGGCAACC GCAAAGTCCG CTACCGCGGC
GTCGCCAAGA ACGACCACTG GTGGCACCAC CGCGCCGCTG CGATCAACCT CAGGCGAATG
CTCACCCTCG GGCTGACGCG GGTGAGCGGG ACGTGGACCA TTGCACCGGC CTGA
 
Protein sequence
MQGRSRDQRE LLDAESVVGG LLKPGSVFAF LAAHRREVFP DGMFADLFPS GRGRPSVPAD 
VMASVIVLQS LHGLSDADTV DSVTFDLRWK AACGLPVTAA AFHATTLTYW RRRLAASQSP
NRIFDAVRQV VDQTGVLAGK SRRALDSTIL DDAVATQDTV TQLIAAIRRV RREVPGAAEV
VGEHCSAHDY DDPGKPAIAW NDQQAREALV DALVTDAHRV LGHLPDQELG PKAADAVALL
ALVAGQDVEP VEGSDGTDGR WRIAQRVAPD RVISTVDPEA RHAHKTVHRR QDGFKAHIAV
EPDTGLVTAC AVTMASGRGN SDAEVGPTLL AQETEKLHVL ADSAYGSGSA RAELDHAGHI
ALIKPFPLRS AVPGGFTLDD FTVDPEARTA TCPNGVTRSI TAQWSVTFGA ACRGCPLRAQ
CTTSDAGRSL KLTEYESLLR AARRQAETED FQQVYRRHRP MVERSISWLV RGNRKVRYRG
VAKNDHWWHH RAAAINLRRM LTLGLTRVSG TWTIAPA