Gene Namu_4195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4195 
Symbol 
ID8449821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4634361 
End bp4636256 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content63% 
IMG OID645043244 
ProductRhamnan synthesis F 
Protein accessionYP_003203473 
Protein GI258654317 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGCG TCATCTTCTA TCTGTTCTAC GATGCCCAGG GCATCGTCGA TGACTACGTT 
CCCTACAAAC TCAACGCACT GCGCCCGTTC GCGGACCACA TCTTCGTCGT TTCCAATTCC
ACCCTCACGC CGGAGGGGCG GGAAAAACTC GCCGACGTCG CCGACACCGT CCTGGCCCGC
GAGAACGTCG GCTTTGACGT CTGGGGTTAC AAGGAGGCGA TGGAGGCCTT CGGTCGCGAC
CGGCTCGCCG AATACGACGA GCTCATCCTG ATGAATTACA CGTTCTTCGG CCCGATCTTT
CCGTTCGCCG AGACGTTCAC GGCGATGGAC GCCCGGGAGG ACATCGATTT CTGGGGCCTG
ACCGCGCACG GAGAAGTGGA CCCCAACCCG TTCCCGGACA CCACCGGAAC CCTTCCCCTG
CACATCCAGT CCCACTGGAT CGCCGTTCGC AAGACGATGT TCACCTCGAT CGAGTTCGCC
TGGTACTGGG ACAAGATGCC CATGGTGACC TCGTACACGG ATTCAATCCT GCAGCACGAG
TCAAAATTCA CTCAGCACTT TGCCGATCGC GGCTTCCGGT ATTCGATCCT GTTCGATCCG
AGCCGATATC CGACGACGCA TCCGGTCTTC GACAGCGCCG ATCTGATGCT GGGCGACCGG
TGCCCGATCC TCAAGCGGCG CATGTTCTTC CACGAGCCTA CCTATCTCGA GCGCAATGCC
ATTCTCGGTC GCCGGGTCAT GGAGATCGTG TCCCGCACGG ACTATCCGGT GGATCTCATC
TGGCGCAATG TCGTGCGCTC GGCCGAGCCG CGCACGCTGT ACACCAACAT GTCGATGCTG
TCGGTCGTGC CCGATGTCGA CACCGGATTC CGGCCCGATC CGCCGCTGCG GATCTGTGTG
CTGGCCCATA TCTTCTATGA AGACATGACC GACGAGATGA TGGGCTGGAT CGGAAATATT
CCCGTCCCCT TCGATCTGGT CGTCACGACG ACGAGCGCCG CCAAGAAGGA GGCCATCGAG
TCCGCCCTGG AGGCGTACGC ACTGAAATCG GTCGAGGTGC GGCTCGTCGA GAGCAATCGG
GGGCGCGCGG AAAGCGCCTT CCTCATCGCC TGCCGCGACG TGCTGACCTC CGGCGAGTAC
GACCTGGTCC TCAAGATCCA TTCGAAGAAG TCCCCGCAGA ACGGCGCCAA TCTCGGGCAG
TTGTTCAAGC ACCACTCGGT GGACAACCTG CTGTCCTCAC CGGGCTACGT GGCGTCCATC
CTGGGCATGT TCCAGAGTCA GCCCAGCCTC GGCATGGTCT TCCCGCCGGT GGTCAACATC
GGATTCCCGA CCCTGGGACA CTCCTGGTTC ACCAACCGCG AGGCCGCCCA CGAGCTGGCC
GACCAGCTGG GCATCCACAC GATCTTCGAC CGGACCACGC CGCTGGCCCC CAACGGCACC
ATGTTCTGGG CGCGACCGGA GTCCCTGGCC AAGCTGGCCA GGCATGACTT CGACTATTCA
CAGTTCGCGG CCGAGCACGA GGGCTGGTCG GACGGCATGC TCGGGCACGT CATCGAACGA
CTGTACGGTT ATGCCGTCCT GGACGCGGGA CTCAGGATCC AGTGCGTGTT CAACACCGAC
TGGGCATCGA TCAACTACGT CTTCCTGGAA TACAAGCTGC AGCGCATCCT GTCGATGCTG
CCGGCCCACA CCCAGGAGGC GGTGGACTAC CTGGAGCGCG CCCGGGCCGC GCTCGAGAGC
CCGCCGCCGC CGCCCCCGCC GGAGGAACCG CCGCTGGCCC TGCTCAAGCA CTCGGTCGAT
CGGTCGTACC CGCGGTTCGG CCGCTTCATG CGGCCCTTTT ACCACGCCGC TCGCGCCACT
GTGCGGACCG GTCGGCGCAT GCGGAAGGCG CGATGA
 
Protein sequence
MRRVIFYLFY DAQGIVDDYV PYKLNALRPF ADHIFVVSNS TLTPEGREKL ADVADTVLAR 
ENVGFDVWGY KEAMEAFGRD RLAEYDELIL MNYTFFGPIF PFAETFTAMD AREDIDFWGL
TAHGEVDPNP FPDTTGTLPL HIQSHWIAVR KTMFTSIEFA WYWDKMPMVT SYTDSILQHE
SKFTQHFADR GFRYSILFDP SRYPTTHPVF DSADLMLGDR CPILKRRMFF HEPTYLERNA
ILGRRVMEIV SRTDYPVDLI WRNVVRSAEP RTLYTNMSML SVVPDVDTGF RPDPPLRICV
LAHIFYEDMT DEMMGWIGNI PVPFDLVVTT TSAAKKEAIE SALEAYALKS VEVRLVESNR
GRAESAFLIA CRDVLTSGEY DLVLKIHSKK SPQNGANLGQ LFKHHSVDNL LSSPGYVASI
LGMFQSQPSL GMVFPPVVNI GFPTLGHSWF TNREAAHELA DQLGIHTIFD RTTPLAPNGT
MFWARPESLA KLARHDFDYS QFAAEHEGWS DGMLGHVIER LYGYAVLDAG LRIQCVFNTD
WASINYVFLE YKLQRILSML PAHTQEAVDY LERARAALES PPPPPPPEEP PLALLKHSVD
RSYPRFGRFM RPFYHAARAT VRTGRRMRKA R