Gene Namu_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3985 
Symbol 
ID8449604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4399753 
End bp4401774 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content74% 
IMG OID645043030 
Productprotein of unknown function DUF349 
Protein accessionYP_003203266 
Protein GI258654110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0275004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0113989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACG ACGTCACGAC CACCGAAGGA CATTCTTCCG GAGAGCCGGC CGGCGCGGAC 
CGCCCGGACC CCACGCTCAG CGTGCCGGAG ACCCCGGATT CGCCGGACAT CCCCACCCCG
GAACCGGGCG CCCCGGCGCC GGCTCCGGAG GTGCCGGACC TGCCCGAACT CCCCGACCTA
CCGGACCTTC CCGACGCTCC CGCGCCCCCG TCGACCACGA CGCCCACCGA CGACGCCGCA
CCCGCGGCCG GGACGGACGA GGGCTCGGCC GCGGGTGCGG ACGAGCTCGC CCACGCCGGA
TCGAGCGGGC CCGTCCCGAT GCCGACGGTC GCCCCACGCA CGCCACCGGT CGCGATCCCG
ACCGTGGCGC CGTCGATCGA ACCCGCGCCG ATGCCCTCCC CGATCACGAC CAGCACCCCG
GCCGATGCGC CCGACCAGCC GACCGTCGAG CCCGCGACGG ATCAACCGGC ACCGGCCGAC
CAGGCCGACG CTGCACCGGC CGAGCAGACC GACGCTGCAC CGGCCGACCA GGCCGACGCT
GCACCGGCCG ACCAATCCGA CGACGCCGCG CCCGAGCCGG CGGCCGCTCC ACACCCGGAC
GCCGACCCGG CCACCAGCAC GCCGGCTCCC GCGACCCCCG GGCAGCGGCC GGGCCAGCGT
GGTCCTCGTC CCCGCGGCGG GCGTCCCGGG TCGCCGGGGT CGCGCCCCGG TGGCCGGCCG
GGTGGCCCGG GCGGCGGCCG GCCCAGCCCG GTGCCGGCGA CGCCCACGCA CGCCCCGGCG
CACTCGACGG TGGTCGAGCC GGTGGTCGAC AGCGTCGACC CGCACGAGTG GGGCCGCATC
GACGACGACG GTGTCGTCTA CGTCCGCACG GCCGCCGGCG AGCGCGCGAT CGGCAACTGG
CAGGCCGGTG ACGCGGAGGC CGGGCTGGCC CACTTCGGCC GCAAGTTCGA CGACTTCAAC
ACCGAGATCG CTTTGCTGGA GGCGCGTCTG GCCTCCGGCA CGGGTGATCC CAAGGCCACG
AAGGCGCAGG CCATCGCGCT GCGCGACCAG GTCGAGTCGC TGTCGGCGAT CGGTGATCTG
GACAACGCGG CGGTCCGGCT GGAGATCGTG ATCGGCGTCG CCGACGCCGC CATCGCCGGC
GCCTCCACCG CCCGGATCGC GGCCCGCAAC GCCGCGATCA AGGCCAAGGA GGACCTGTGC
GCGGAGGCGG AGGCCCTGGC CGAGTCGACC CAGTGGAAGT CCACCGGCGA CCGGCTCAAG
GCGATCGTGG ACGAGTGGCG CACCATCCGC GGCATCGACC GCAAGACCGA TGACGCGCTG
TGGAAGCGGT TCGCCAAGGC CCGGGACACC TTCACCCGGC GCCGGGGCTC CCACTTCGCC
GAACTGGACA AGCAGCGGGG CGCCGCCCGG GAGGCCAAGG AAGAGCTGAT CAAGCGGGCC
GAGGCGCTCT CGGACGCCAG CGACTGGGGC GAGACGGCGG CCAAGTACCG CGCCCTGATG
GAGGAGTGGA AGGCCACCGG CCGGGCTCCG CGCGACGTCG AGGACGCGCT GTGGGCGCGT
TTCCGGGCCG CGCAGGAGAA GTTCTTCTCC CGCCGGAACA AGGTGTTCTC CGACCGGGAC
GCCGAGTTCG CGGCCAACGC CGCGACCAAG GAAAAGCTGC TCACCGAGGC CGAGAAGATT
GACCCGGCAG CCGGTCTGGA CCAGGCCAAG GCCAAGATGC GCTCGATCCA CGAGCGCTGG
GAGGCGGCCG GCAAGGTCCC CCGGGAACGG ATCCGGGACC TCGATCAGCG CCTGAAGACG
ATCGAGGACC GCATCAAGGC GGCCGAGGAT CGGCAGTGGC GGCGCACCGA CCCGGAGACC
GACGCGCGGG TGGCGCAGTT CCGGGCCCGG GTCGAGTCGT TCCAGGCCCA GGCGGCCAAG
GCCCGCGCGG CCGGGGACGA GCGCAAGGCC AAGCAGGCCG AGGCCCAGGC CAAGCAGTGG
GAGGAATGGC TCAAGACCGC GCAGAGCGCG GTCGACCGTT AG
 
Protein sequence
MPDDVTTTEG HSSGEPAGAD RPDPTLSVPE TPDSPDIPTP EPGAPAPAPE VPDLPELPDL 
PDLPDAPAPP STTTPTDDAA PAAGTDEGSA AGADELAHAG SSGPVPMPTV APRTPPVAIP
TVAPSIEPAP MPSPITTSTP ADAPDQPTVE PATDQPAPAD QADAAPAEQT DAAPADQADA
APADQSDDAA PEPAAAPHPD ADPATSTPAP ATPGQRPGQR GPRPRGGRPG SPGSRPGGRP
GGPGGGRPSP VPATPTHAPA HSTVVEPVVD SVDPHEWGRI DDDGVVYVRT AAGERAIGNW
QAGDAEAGLA HFGRKFDDFN TEIALLEARL ASGTGDPKAT KAQAIALRDQ VESLSAIGDL
DNAAVRLEIV IGVADAAIAG ASTARIAARN AAIKAKEDLC AEAEALAEST QWKSTGDRLK
AIVDEWRTIR GIDRKTDDAL WKRFAKARDT FTRRRGSHFA ELDKQRGAAR EAKEELIKRA
EALSDASDWG ETAAKYRALM EEWKATGRAP RDVEDALWAR FRAAQEKFFS RRNKVFSDRD
AEFAANAATK EKLLTEAEKI DPAAGLDQAK AKMRSIHERW EAAGKVPRER IRDLDQRLKT
IEDRIKAAED RQWRRTDPET DARVAQFRAR VESFQAQAAK ARAAGDERKA KQAEAQAKQW
EEWLKTAQSA VDR