Gene Namu_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1051 
Symbol 
ID8446647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1161125 
End bp1162909 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content70% 
IMG OID645040189 
ProductRhs element Vgr protein 
Protein accessionYP_003200448 
Protein GI258651292 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGG CCCATCCCCT GTCCATCAAC CGAACGGCCC CGGTGCTGCA GGTCAACGGG 
GCGCCGCTGG CGGCGAGCGC GCTGGCCGCG TTGATGGAGC TGCGCATCAC CCGGGGCCTG
CGCTTGCCCG GTCGGGCCAC GCTGCGCTTC GACGATGCCG GCCACGCCCT GGCCGCCGGC
GGCACTTTCG CCATCGGCGC CACCGTCTCG GTCGCCGCCG GCTCCGGGCA GGTGCTGCTC
TCCGGTGAGG TCACCGGGGT CGACCTGAGT ATCGAGTACG GGCAGCCCGA GTTCACCGTC
GTCGTCGACG ACCTGTCCTA CAAGCTCACC CTGGGCGCCA AGATCCGCAC GTTCGCCGCC
ATCACCTACG GCGAGGTGAT CGGCCAGATC TGCCGGGAGC AGGGCCTGAG CTTCGAGAAC
CGCGGCGGCA CCGACGCGTT GACCGTGCAG CAGGACTACC TGATGCAGTC CGACACCGAC
TTCGGCTTCC TGACCGAGAT CGCCGACCGG ACCGGCAACG ACTGGTGGGT GGAGGAATCG
ACGCTGGTGC TGGCGGCGCC GACGGACGCG CAGCCGGTCG CCTCCCTCGG CTTCGGCGAC
GACCCGACGC TGACCGCGTT CACCGTCCGG GCCAGCGCCC TGCACCCGGC CGAGCCGGTG
GTGCACGGCT GGTGGCCGCA GACTAAGCAG GCGGTCACCG CGCAGGGGCG GGCGGCCAGC
ACGTCCAGCG TGCCGACGCT GGTCGAGCCG TTCATCCAGG CCAGCGACCT TTCCAGCCGG
GCCAAGACGG TGACGGCCAG CGAGGTGCCG CTGGACCAGT CCGACGCGGA GGTGCTGGCC
AACCGGCTGG CCGGCCGCTG GAGCTCGGGG GCGGTGACCG CCCGCGGCAC CACGGTGCAG
GTCGAACCAG CCGTGGTGCC GGGATCCACG GTGGCGGTGA CCGGTTCGGG TCCGGCGTCG
GGCAATTACC ACGTCACCGA GATCGAACAC GTCTACAACC GCCGGGGTTT CAGCACCCGC
TTCACCGCCG GCGACCGCCG GCCGTCCTCC CTGGTCGACG CGCTGTCCAC CCAGGCCTCG
TCGAGCTTTC GCCGGCAGGG CATCGTCATC GGGGTGGTCA CCAAGGTCGG CAACCCGAAC
GGCTCCCCCG GCGAGGTCAA GGTCGCCTAC AAGTCGGCCG GGGACCAGGT CGAGTCGAAC
TGGGCCCGGG TGGTCACCGT GGGCGCCGGC AACGGCCGGG GCGCGACGTT CATCCCGGAG
ATCAACGACG AGGTGATCGT CGGGTTCGAG GGCGGCGACA GCCGCCGGCC GATCGTGCTC
GGCGGCGTGT ACAACGGTCA GGACGTGCCG GTGGAGTTCG GGGTGGCCAA CAGCAAGGTC
AACAAGCGCC GGATCACCTC CCGGGCCGGG CATTTCCTGG AGTTCGGGGA CGGCGACGCG
ACCGCCGACC AGCACATCAG CTTCACCCTG GCCGGCGCGG AGCACCAGAT CGTGCTGGGC
AAGGAGAAGT TCGAGGCGAC GGTGCCCTCG GGCAAACCGA TGACGATCAA GTCCGGCAAC
TCCAGCATCG CCATCGGCGC CGACGGATCG ATCACCATCC AAGGCAAGAA GATCACCCTC
AAGGCCGACA CCGACGTCGA GATCTCCGGG GTCAACGTGA CCACCAAGGC CAGCGTGAAG
GCCGAGACCT CGGCGACCCA GGTCGCGATC AAGGCCTCGG CCACCGGCGA GGTCTCCTCC
GGCGGCATGA TGTCGGTCAA GGGCGCCATG GTGGCGGTGA ACTGA
 
Protein sequence
MTMAHPLSIN RTAPVLQVNG APLAASALAA LMELRITRGL RLPGRATLRF DDAGHALAAG 
GTFAIGATVS VAAGSGQVLL SGEVTGVDLS IEYGQPEFTV VVDDLSYKLT LGAKIRTFAA
ITYGEVIGQI CREQGLSFEN RGGTDALTVQ QDYLMQSDTD FGFLTEIADR TGNDWWVEES
TLVLAAPTDA QPVASLGFGD DPTLTAFTVR ASALHPAEPV VHGWWPQTKQ AVTAQGRAAS
TSSVPTLVEP FIQASDLSSR AKTVTASEVP LDQSDAEVLA NRLAGRWSSG AVTARGTTVQ
VEPAVVPGST VAVTGSGPAS GNYHVTEIEH VYNRRGFSTR FTAGDRRPSS LVDALSTQAS
SSFRRQGIVI GVVTKVGNPN GSPGEVKVAY KSAGDQVESN WARVVTVGAG NGRGATFIPE
INDEVIVGFE GGDSRRPIVL GGVYNGQDVP VEFGVANSKV NKRRITSRAG HFLEFGDGDA
TADQHISFTL AGAEHQIVLG KEKFEATVPS GKPMTIKSGN SSIAIGADGS ITIQGKKITL
KADTDVEISG VNVTTKASVK AETSATQVAI KASATGEVSS GGMMSVKGAM VAVN