Gene Namu_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2068 
Symbol 
ID8447678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2279877 
End bp2282063 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content70% 
IMG OID645041192 
Producthypothetical protein 
Protein accessionYP_003201437 
Protein GI258652281 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000291289 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0152831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGGC AGGTGACTTG GCGCGAGCTC AGATTCCCCG GCCCAATCCC ACCCGACGCC 
GTGCGCGCCT GTCTGGTCGG CTGCGGCGGC CTGCCCGGTA ACCCGCTGCT CATCCTCGAA
GCGGTCGCCA CCGGCGGCCG GCTGCGCTGG CGCATCGGTG CCCCGTCAAG CCTGAGCGTC
CGGCAAGCCT GCCAACAGGT GACCCTGCAT CTGCCGGGCA CCATCGTCAG CCCGGTCGAG
GGCACGTGGC CACCGGTCGA GACCGCCCAA ATCGGGGCGC AGGTACGGAT CAGCAGGACC
AGGCTGCTGC CGCTGGGTGA CCTGGATCCG GATGCGGTGA CTCGCGGGCT GCTCGGGGCG
CTGGCTTTGG CCGGCCAAGG TGAATCCCTG CACTACCAGC TGCTACTCGG TGCCCGTATC
CGCCCAACTC GGCCACCCGA AGCAGCCAAA CCGCTGCCTC GCGGTGCCAC CGAGCGGCTC
GAGCAACCGG GCTTCGGCTG CCTGGTGCGC CTGGCCGCCA CCGCCGCGCA CGATGGACGC
GCCCGCATCT TGGTGCGACA GGTTGCCGCG AGCTTGAAAG GCCTGGAAGC CCCTGGGGTA
GCGGTCACCG TCCGCCGGTG CGCAGTGACG GCACTGGCTC GGGTATCGCT GCCGCTCTGG
TGGCCATTGT GGATCTCCGC CAGCGATCTG GTGCCACTGA CGGCCTGGCC GGTGGCTGAC
GACCCGAAGG CCGAGCTGGC CGGCTTGCCG CCGCGGCACC CGAAGCTGCT GCCGCCCACG
GCCGCCCACC CCACGGAGGG GCTCGACATC GGCAGGGCCG TTGCCGTCAG TATGGAGCGG
CGTATCGCGC AACCGGTCGG CGACACTCTG CAGCACCGGC ACGTACTGGG GCCGTCCGGG
GTGGGCAACT CGACGTGGGT CGGCTATGCG GCCCTGCAAG ACATCCTGGC CGGCCGGTGT
GTCGTGGTCA TCGACCCGAA GCGCGATCTG GTCACCGAAC TGCTGGCCCG TATCCCGGCC
GACCGGTTGG ACGACGTCGT GGTGCTCGAC CCGGCCAGCC GCACCCCGCT GGGGGTCAAC
CCCTTGGCCG GTGCCGACCC TGACCTGGCA GCGGACTCGA TCGTCGCCGT CTTCCACTCC
CTGTACGGCG ACGGGCTCGG ACCCCGCTCG ACCGACATCC TCCATGCCGC CGTGTTGACA
CTGGCCCGGC GGGGTGATGC CTCGCTGGCG ATGGTGCCGC TGCTACTGAC CAACCCCGGT
TTCCGCCGTT CGCTGGTCGG CCGTGCCCAT AAGGAAGACC CGCTCGGTGT CGGCAGCTTC
TGGGCCTGGT TCGACAGCCT GAGCAGTGGC GAACAGGACA TGGTCACCCG GCCCCTGCTC
AACAAACTGC GGCCGATCAT GCTGCGGCCA TCCCTGCGCG GCGTGTTCGG ACAACGGCGT
CCGGCGATCT CCGTCGAGCA GATCCTGGCC GAGCGAAAAA TCCTGCTGGT CGAACTGCGC
AAGGACCAGA TCGGTCCCGA GGCCGCCGGG CTTATCGGGT CACACGTGGT CGCCCACCTG
TGGCGGGCCA TCCTCGGTCG GCTGAGTCTG CCGGCCGAGC AGCGCCACCC GTTCATGGTC
TACATCGACG AAGTCCAGGA CTACCTGCGG CTGCCGGGCA GCCTGGCCGA CGCGCTGGCC
CAGGCCCGCG GCCTCGGGGT ATCCGTCACG GCAGCCCACC AGCACCTGGG ACAGCTCGGC
CGACTCGACG CCGATCTGGA AGCCAACACC GCCACCAAGC TGTTCTTCCG GCTCAGTCCG
GGCGACGGCA GACACCTGGC CAGCGCCGTC GGTGCCGGGC AGCTGACCGC CGACGACTTC
ACCCTCCAAC CCGACCACCA GCTCTACGCC CGGCTGCTGG TCAACGGCCG ACTGCTGCCC
TGGGTCTCCG TCGCCACCCA GCCTCTGCCG CCACCACTGC ATGACCCGGC CCAGGTCCAA
GCCCGCAGCG AAGCCCACTA CGGCCGCGCC CTTGACGACG TCGAAGCCGA CCTGCTCAGC
CTGGCCGACG CCACGGCCCG AGAGGGCAGC ACCGAGCACC CGGACGAAGC CACACCACGC
ACCGGCTTCG GCCGCCGCCG GCCCAGCGCA CCATCCCGGC CGTCCGCCGA ACCCCACCCC
GTTACCCAGA AAAGAGGTCG GATATGA
 
Protein sequence
MSRQVTWREL RFPGPIPPDA VRACLVGCGG LPGNPLLILE AVATGGRLRW RIGAPSSLSV 
RQACQQVTLH LPGTIVSPVE GTWPPVETAQ IGAQVRISRT RLLPLGDLDP DAVTRGLLGA
LALAGQGESL HYQLLLGARI RPTRPPEAAK PLPRGATERL EQPGFGCLVR LAATAAHDGR
ARILVRQVAA SLKGLEAPGV AVTVRRCAVT ALARVSLPLW WPLWISASDL VPLTAWPVAD
DPKAELAGLP PRHPKLLPPT AAHPTEGLDI GRAVAVSMER RIAQPVGDTL QHRHVLGPSG
VGNSTWVGYA ALQDILAGRC VVVIDPKRDL VTELLARIPA DRLDDVVVLD PASRTPLGVN
PLAGADPDLA ADSIVAVFHS LYGDGLGPRS TDILHAAVLT LARRGDASLA MVPLLLTNPG
FRRSLVGRAH KEDPLGVGSF WAWFDSLSSG EQDMVTRPLL NKLRPIMLRP SLRGVFGQRR
PAISVEQILA ERKILLVELR KDQIGPEAAG LIGSHVVAHL WRAILGRLSL PAEQRHPFMV
YIDEVQDYLR LPGSLADALA QARGLGVSVT AAHQHLGQLG RLDADLEANT ATKLFFRLSP
GDGRHLASAV GAGQLTADDF TLQPDHQLYA RLLVNGRLLP WVSVATQPLP PPLHDPAQVQ
ARSEAHYGRA LDDVEADLLS LADATAREGS TEHPDEATPR TGFGRRRPSA PSRPSAEPHP
VTQKRGRI