Gene Namu_4400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4400 
Symbol 
ID8450026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4883724 
End bp4885409 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content74% 
IMG OID645043447 
Producthypothetical protein 
Protein accessionYP_003203676 
Protein GI258654520 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGGG CTCGAATCGG TGCGTTCGCC CTGGCCGTGG CGGCCACCGG GACCCTGGCG 
GCGACCGTCG CGTCCCTACC CGCGCAGGCG ACCACCACGG CGTCCTCGGC GCAGGCCGTG
GGGACCCCGG TGACCAGCGC GAACGGCGAG GCGACGCTGA TCGTCGACCC GACCAGCGAC
CTCGCCGACA CCGGTGCCTC GATCAAGGTC CAGGGCAAGG GTTTCGGGAC CGATCCCGGC
GGCATGTACG TCGCGGTCTG CCGGGACGCC GGGGCCACCC CGAACCTGGA CCAGTGCGTC
GGCGGGCCCG TCCCGACCAA CCCGACCCCG GGGGCGTGGG CGCACATCGT GGCCAGCGGC
ACCGGGGTCA ACGTGGCCAA CTGGAACGGG GGCGGCTTCC AGGTGACCCT GGCCCTGCCC
TCGGTGGCCG GCGGGTCCGT CGATTGCGTC AAGTCGGCCT GCGCGCTGTA CACGGTCAGC
GACGACGGCA GCGAGCCATC GCTGGACAAC CGGATCCCGC TGGGCTTCAA GGCGCCGACG
TCGTCGGCGC CGTCCACGCC GACCAGCGCG ATCGTGCAGC AGGTCGGCTC GCCGACCATT
GCCCCCGGGG CGACCCAGTC GGTGATCTTC TCCGGCTTCA AGGGCGGTGA GCAGGTCAAC
CTGACCCTGT TCTCCGAGCC GGTCACCCTG TCGCCGGTCA CCGCCGATCC CACCGGGGTG
GCCCGGGTCG ACTTCGTGGT CCCGGCCGAC TTCGTGGACG GCGCGCACCG GTTGGAGGCG
ATCGGCGCCC AGTCCGGCAC GGTCGGGGTG GCCAGCTTCC AGGTGGTCGT GCCGACCCCC
ACCCCGAGCC CCACGCCGAC GCCGAGCCCG ACCCCCTCGC CCAGCCCGTC GCCGACGTCG
ACGAGCCAGA CCAGCAGCGC GGCGAGCTCC AGCAGCCTGG CCCCGACGAC CGCGGCGACC
ACGACCAGCA CCGACAGCGG GGGCGATTCC GGCGGCTCGA ACTGGTGGAT CTGGCTGATC
CTGGCCCTGG TCGTGCTGGC CGGGCTGATC ACCTGGTTCG TCGTCGACCG CCGGAACAAG
GAGGCGGCCC GGGCCGAGCA GGAACGGCAG CTGGCGGACG CCGCCAACCG GCAGCAGCCG
CCCTACGACC CGATGGCCGA CGCGCCGACG ATGATGATGC CGCCGGCCGA TCCGCGGCCC
AGCGGACCGC CGCCGGGGGC CGATCCCTAC GGCCTGCTCT CCGGGCGCAA CCACCCCTAC
GGCGTCGACC CGAACGCGCC GACCCGGTAC GACCCGCCGG CCGGTCCGAC CCAGTACATC
CCGCCCGATC CGGGGCAGTA CCAAAGCGAT CCGACCCAGG TCATCCCGCC GGGTCAGGCC
GGCCGGGGTC AGGGCGGTCC GGGTCAGGGC GGTCCGGGTC AGGGTGGTCC GGGTCGGGGT
CCGGCCCCCG ACTGGACGGT CCCGCCCGAG TTCTCGGGCG GCCCGAGCGA GCGTCCGGCC
GGCCCGCCGA CGACTCCGCA GCCGCAGTCG CAGCCGCCGG AGCAGGGTCC CCGGACGGCG
CAGTTCCGGC CGGACTTCAA CGATCCGAAC GGCCGCGATC CGAACAGCGA CGATCCGAAC
GGCCGCGACC CGAACAGCAG CGACGAGTCC GACGGCCCGA CCGACACCGG CCCGCGGTCC
CGCTGA
 
Protein sequence
MMRARIGAFA LAVAATGTLA ATVASLPAQA TTTASSAQAV GTPVTSANGE ATLIVDPTSD 
LADTGASIKV QGKGFGTDPG GMYVAVCRDA GATPNLDQCV GGPVPTNPTP GAWAHIVASG
TGVNVANWNG GGFQVTLALP SVAGGSVDCV KSACALYTVS DDGSEPSLDN RIPLGFKAPT
SSAPSTPTSA IVQQVGSPTI APGATQSVIF SGFKGGEQVN LTLFSEPVTL SPVTADPTGV
ARVDFVVPAD FVDGAHRLEA IGAQSGTVGV ASFQVVVPTP TPSPTPTPSP TPSPSPSPTS
TSQTSSAASS SSLAPTTAAT TTSTDSGGDS GGSNWWIWLI LALVVLAGLI TWFVVDRRNK
EAARAEQERQ LADAANRQQP PYDPMADAPT MMMPPADPRP SGPPPGADPY GLLSGRNHPY
GVDPNAPTRY DPPAGPTQYI PPDPGQYQSD PTQVIPPGQA GRGQGGPGQG GPGQGGPGRG
PAPDWTVPPE FSGGPSERPA GPPTTPQPQS QPPEQGPRTA QFRPDFNDPN GRDPNSDDPN
GRDPNSSDES DGPTDTGPRS R