Gene Namu_5290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5290 
Symbol 
ID8450923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5909250 
End bp5910509 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content77% 
IMG OID645044323 
ProductVWA containing CoxE family protein 
Protein accessionYP_003204545 
Protein GI258655389 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGAGC CGGACCTGAG CCTGGGCGCC GATCTGGCCA CGGTCGGCGC GCTGCTGGCC 
GACCGGCTGG CCCGTCACGG GGTGCCGGTG CCGGTGCACC GGGCGGTCTG GTGGACCCGG
GTGGTGATGG CCGGCGGGCC GACCACCGTG GACGAGTTGT ACTGGCTGTC CCGGGTCAGC
CTGATCGACC GGCACGAGCA CCTGCCGACC TTCGACGCGG TGTTCGCCGG ACTGTTCGGC
GCCGGCACGG CCGCGCCGCC GGACCCGGCC GCGTTCCGGG GCGATCAGAA CAACGAGCTG
CCCCCGGCCG CGGCCGCCGG GTCACCGTCG CCGCCCCAGG CCCCGGACGC CCCGCCGCCT
CCGACGCGTG ACGAGCAGCC CACGGTGCAG CAGGTGGGCG ACGACGAGGA CACCGGGGCC
GAGCCCGAGG AGGACCAGTC GCCGGGCGTG GCCGCCGTGG CCTCGTCGAT CGAGCTGCTG
CTGGCCAAGG ACTTCGCCGA CTGCGACGCG GACGAGATCG CCGAGCTGAA CCGGATCGTG
GCCCGGATGC GGATCGTCGC GCCCACCCGG CCGGCCCGCT GGAAACCCAC GCTGGGGCCG
GGCCGGTCGG TGGATCTGCG GCGGACGCTG CGCCGGGCCA GCCGCACCGG CGGGGACCCG
GTGCGCTGGG TCCGCCGGCG CCGCAGCGCC GTGCCCCGGC GGGTCGTGCT GCTGGCCGAC
GTGTCCGGGT CGATGCAGTC CTACGCCCGG GTCTACCTGC GGGTGCTGCA GGGGGCCGCG
CTCGGCGCCC GGGCCCACGC CTACCTGTTC GCCACCCGGC TGCATCCGGT CACCCGGGCA
TTGGTCCGCG GCCCCCGCGA GGGCGGCATC ACCCGGGCCA TGGCCCAGTC GCCGGACGCC
TCCGGCGGCA CCCGGATCGG CGCGGCGATC AAGGAATTCC TGGACACCGA CGGCCGCCGG
GGCCTGGCCC GCGGCGCGGT GGTCGTGGTG GTCTCCGACG GCTGGGAGCG GGCCGATCCG
GCCCTGCTGG GCGAGCAGAT GGCCCGGCTG CACCGGCTGG CCCACTCGGT GATCTGGGTC
AATCCGCGCA AGGCCGCCCC CGGCTTCGCC CCGCTGGCCG GCGGGATGGC CGCCGCGCTC
CCGCACGTCG ACCGGTTCAT CGAGGGGCAC TCGGCGCGGT CGGTGCAGGG ACTGCTGGAC
GCGATCGCCG ACAGCACCGG CGGGCCGGTC GGCGCCGCCC GCCCCGGCCG GCCGGGCTGA
 
Protein sequence
MAEPDLSLGA DLATVGALLA DRLARHGVPV PVHRAVWWTR VVMAGGPTTV DELYWLSRVS 
LIDRHEHLPT FDAVFAGLFG AGTAAPPDPA AFRGDQNNEL PPAAAAGSPS PPQAPDAPPP
PTRDEQPTVQ QVGDDEDTGA EPEEDQSPGV AAVASSIELL LAKDFADCDA DEIAELNRIV
ARMRIVAPTR PARWKPTLGP GRSVDLRRTL RRASRTGGDP VRWVRRRRSA VPRRVVLLAD
VSGSMQSYAR VYLRVLQGAA LGARAHAYLF ATRLHPVTRA LVRGPREGGI TRAMAQSPDA
SGGTRIGAAI KEFLDTDGRR GLARGAVVVV VSDGWERADP ALLGEQMARL HRLAHSVIWV
NPRKAAPGFA PLAGGMAAAL PHVDRFIEGH SARSVQGLLD AIADSTGGPV GAARPGRPG