Gene Namu_5245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5245 
Symbol 
ID8450876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5851644 
End bp5853731 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content74% 
IMG OID645044276 
Productprotein of unknown function DUF477 
Protein accessionYP_003204500 
Protein GI258655344 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG GACGAGGACT GCACCGGGCC GCGGCGGGGC TCGCGGTCGT CACGATCATT 
GGGTTGACCG CACCACCCGC CAGCGCCGAA CCTCCGTTCC GGCTGCCCAA CCAGATCACC
GACCAGGTCG GGGCCCTCAC CGGGTCCGAC CGCACCGACG TGCAGACGGC CCTGGACCAG
CTCTCGGCCG AGGAGAACAT CGACCTGTAC GTGGTCTACG TCGATACCTT CGACGAGCCG
AGCGCGGCCG TCGACTGGGC CGCCCAGACC TGGCAGACCT CCGACCTGGG CGCCAATCAG
ATGCTGCTGG CCGTGGCCAC CGGTGGTCGG GCCTATGCGG TGCACGTGCC GAACAACTTC
AAGATCTCCG ACGCCCAGCT GCAGCAGGTC GCGACCACGC AGATCCAGCC CGAGCTGCGC
AACGACGACT GGGCCGGCGC GGCCATCGCC GCCGCCAACG GCTACCGGGA CGCACTGGGC
GGCGGCTCCT CGACCGTCTG GTGGTGGATC GCCGGCGCCA TCGTGGTCGT CGGGGCCGGC
GGGTACCTGA TCTACCGGCG CCGAGCCAAG GCCGGCGCCG GCTCCGGGCC AGCCGGTCCG
GCGGGTGCCC CGGGGCAGCC CGCGGAGCCG CTCGAGCCGT TGGAGGCCCT GTCGGCGCGC
AGCGTGCAGG TCCTCATCGA CACCGACAAT GCCGTGCGGG CCAGCGAATT CGAGCTCAGC
GCAGCCGAGA GCGACTTCGG CCACGACGCC GTCGCGCAGT TTCGGGTCGC GTTCGACTCG
GCCCGCGAGT CGCTCACTCA GGCCTTCGAA ATCCGGCAGA AGGTTGACGA CGACCAGCCC
GAGGACGACG CCACCAAACG CGCCATGATG AACGACATCA TCGACCGGTG CGCCCAGGCC
TCGGCGACGC TGGACGCGCA GAGCGATCGC TTCGACGAGC TGCGGGGGCT GCGATCCCGG
CTGCCGCAGG TGCTGGCCGA GCTGCCCGGC ACGATCGACT CCCTGCAGGC GCGGATGCCG
GCTGCCGCAT CGACCCTGCA GCGGCTGCAG CAGCAGTTCT CGCCGACCGC ACTGGCCACC
GTGGCGGCCA ACGTCGAGCA GGCCGGTGAG CGGTTGCAGT TCGCCCGGGT CAGCCTGGAC
CAGGCGCGCC AACAGGCGGC CGGATCGACC CCGGCCACCA GCACCCTGCC GCTGCCCGGT
CAGCCGCCGG CGACGGCCAC GCCCCCGGCG GCCGCGGTGT TGGCCGCCGG TGCGGCTCAG
GAGGCGGCCG ACCAGGCCCG GACCCTGCTG GACGCCATCG ACCGGATGGC CGCCGATCTG
GCCACCGCGA CCACGCAGCT GACCGGCGCG ATCAGCGCGG TCGATCAGGA GCTGGCCGCG
GTCCGGGCGG CGCTCGATTC CGCGACCGCC GGGGCCAACG AGGCCTCGAT CCGGGCTCAG
CTCGACCAGA TTCAGGCCAT CCTGTCGGTC GCCCGTTCCC CCCAAGGCGC GGCCGACCCG
ATGACCGCCC TGCACAAGGT CGAAGAGGCC GACCTCGCTC TGGACGGCAT CCTGGCCAGC
ACCCGCAGCG CCCAGCAACA GGAGCAGCGC AGCCAGGCGG CGCTGGGCCA GGCGCTGCCG
ACCGCCCGGG CCGAGGTGGC GGCGGCCGAG GACTTCGTGA ACACCCGCCG CGGCGCGGTC
GGGAGCCAGG CCCGCACCCG GCTGGCCGAG GCGAAGCGGC ACCTCGCCAA TGCCGAAGCC
GGCACCGGCG GCGCCGCGGC CGCGGCGTCC GAGGCCCAGC AAGCGGCCGC CCTGGCCCGG
GAGGCCGCCG ATCTGGCCCA GCGAGACGTG AACGGCTTCG GGGGTGGCGG TTTCGGCGGT
GGGCAGCGCG GCGGCAACAG CGGGCTGGCC GGCGCCGTCC TTGGCGGCAT CGTGCTGGAC
GCCGTGCTCA ACTCGGGCCG ACGCGGTCGT GGGGGCGGCG GCTGGGGCGG GGGCTTCGGT
GGCGGCGGCT ACCGTGGTGG CGGCGGTGGT TTCGGCGGTG GCGGCGGTGG TTTCGGCGGC
GGCGGTGGCG CCGGGTCCGG GCACAGTGGG GGCAGCGGCC GCTTCTGA
 
Protein sequence
MRIGRGLHRA AAGLAVVTII GLTAPPASAE PPFRLPNQIT DQVGALTGSD RTDVQTALDQ 
LSAEENIDLY VVYVDTFDEP SAAVDWAAQT WQTSDLGANQ MLLAVATGGR AYAVHVPNNF
KISDAQLQQV ATTQIQPELR NDDWAGAAIA AANGYRDALG GGSSTVWWWI AGAIVVVGAG
GYLIYRRRAK AGAGSGPAGP AGAPGQPAEP LEPLEALSAR SVQVLIDTDN AVRASEFELS
AAESDFGHDA VAQFRVAFDS ARESLTQAFE IRQKVDDDQP EDDATKRAMM NDIIDRCAQA
SATLDAQSDR FDELRGLRSR LPQVLAELPG TIDSLQARMP AAASTLQRLQ QQFSPTALAT
VAANVEQAGE RLQFARVSLD QARQQAAGST PATSTLPLPG QPPATATPPA AAVLAAGAAQ
EAADQARTLL DAIDRMAADL ATATTQLTGA ISAVDQELAA VRAALDSATA GANEASIRAQ
LDQIQAILSV ARSPQGAADP MTALHKVEEA DLALDGILAS TRSAQQQEQR SQAALGQALP
TARAEVAAAE DFVNTRRGAV GSQARTRLAE AKRHLANAEA GTGGAAAAAS EAQQAAALAR
EAADLAQRDV NGFGGGGFGG GQRGGNSGLA GAVLGGIVLD AVLNSGRRGR GGGGWGGGFG
GGGYRGGGGG FGGGGGGFGG GGGAGSGHSG GSGRF