Gene Namu_3522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3522 
Symbol 
ID8449141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3868031 
End bp3869389 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content73% 
IMG OID645042600 
Producthypothetical protein 
Protein accessionYP_003202836 
Protein GI258653680 
COG category[S] Function unknown 
COG ID[COG4325] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00487246 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.144649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCAG GCCGACGCGG GAGCTGGAGC GTCCTTCGCG ATGCCGTCCG CACCCAGCTG 
TGGCCGTTCC CGGCGCTGGC CATCACGATC GCCGTCATCG CCGGCGTGCT GCTGCCCAGG
CTGGACGCGC AGATCGACGA CCAGCTCCCG CCGCAGATCA CCTACTACCT GTTCGGCGGC
GGCGCGGACG CCGCCCGCGC GGTGCTGGCC GGCATCGCCG GATCCCTGAT CACCGTGACG
TCCCTGACCT TTTCGCTCAC GGTGGTCACC CTGCAGCTGG CCAGCAGCCA GTTCTCGCCG
CGGTTGCTGC GCACGTTCAG CAGTGATCGG TTCGTCCAGC TCACCCTCGG ACTGTTCCTG
GGCACGTTCG CGTACGCGCT GACGGTGCTG CGCACGGTCC GCACCGCCGC CGACGACCAG
GCCCTGTTCG TGCCCCAGAT CGCCGTCACC GTGGCCTTCG TGCTGGCGGT GGCCAGCGTC
TTCGCCCTGG TGATCTTCCT GGCCCACCTG GCCCGGGAGA TCCGGGTCGA GACGATGCTG
GCCACCGTGC ACGGGGACGC CACCACCACG CTGCGCCGGG TCCTGCCCGA GCACGACCCC
GACCGGACAC CCGTCGCCGG GCCCGACATC CCGGCGGACG CCGAGATGGT GCCGGCGCCG
TCCTCGGGCT TTTTGGTCTG GCTCGACGAG CCCGGCCTGC TGCGGGCGGC CGTGCAGGCC
GACGCGATCG TGACCATTAC CGAGCAGCCG GGCAGTTCCC TGATCGAAGG GGTGCCGGCC
GGCGCGTGCT GGCCCCGCGG CGGTGGGCGC TTCGAGCCGG ACGTTCTGCG CACGCTGCGC
GAGCGGGTCG CCCCGACCAT CCACACCGGA CCCGAACGCA CCGCCGCCCA GGACGCCGCC
TTCGGGCTTC GCCAGCTCGC CGACGTCACC GTCAAGGCCC TGTCCCCCGG CATCAACGAT
CCGACCACGG CGATCCACGC GCTCGGGCAC CTGTCCGCGT TGCTGGGCGA GCTGGCCGCC
CGCGACCTGG GCCCGCACGT ACTCACCGAC GACGGCGGCC AGCTGCGGGT GGTCCTGGCC
CGGCCCACCT TCGCCGAGCT GCTGGAACTC GCGGTGGCCC CCACCCGCCG GTACGGCGCC
GCCGACCCGG ACGTGCTCGC CCGGCTCTTC CAGCTGCTGC GCGAGATCGC CTGGACCGCA
CCCGACGCCG AGCATCACCG GGCCATCGCC GGCCAGCTCG AGCGGCTGCG GACCACGGCC
CAGGCCCAGT CGTTCGACCC GACCGAGCGG GCCTACCTCA GCCGGCTGGC CGATCAGGTC
GACCAGACCC TGGACGGCCG CTGGGTGCTG CGAACCTGA
 
Protein sequence
MRAGRRGSWS VLRDAVRTQL WPFPALAITI AVIAGVLLPR LDAQIDDQLP PQITYYLFGG 
GADAARAVLA GIAGSLITVT SLTFSLTVVT LQLASSQFSP RLLRTFSSDR FVQLTLGLFL
GTFAYALTVL RTVRTAADDQ ALFVPQIAVT VAFVLAVASV FALVIFLAHL AREIRVETML
ATVHGDATTT LRRVLPEHDP DRTPVAGPDI PADAEMVPAP SSGFLVWLDE PGLLRAAVQA
DAIVTITEQP GSSLIEGVPA GACWPRGGGR FEPDVLRTLR ERVAPTIHTG PERTAAQDAA
FGLRQLADVT VKALSPGIND PTTAIHALGH LSALLGELAA RDLGPHVLTD DGGQLRVVLA
RPTFAELLEL AVAPTRRYGA ADPDVLARLF QLLREIAWTA PDAEHHRAIA GQLERLRTTA
QAQSFDPTER AYLSRLADQV DQTLDGRWVL RT