Gene Namu_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3066 
Symbol 
ID8448680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3377932 
End bp3379290 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content73% 
IMG OID645042148 
ProductCupin 4 family protein 
Protein accessionYP_003202389 
Protein GI258653233 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0000118517 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00228521 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAGCCC TTGATTCGAC GGGGGCGTCC GGCTCGGCCG GGCGCCCCCC GCGTCACTGG 
TCACCCGTGC AGCGGTGCAT CGCCATCGAC GCGGACGATT TCGCCCAGCG CTACTGGGCG
CAGGCCCCCC TGCTGACCAC GGCGGCCGAG CTGAACGACG ACTTCAGCGA CCTGTTCTCG
GCCGACTCCG TCGACGAACT GGTCTCCGAG CGAGGGCTGC GTACCCCGTT CCTGCGGATG
GCCAAGAACG GCTCGGTGCT CTCCAGCGCG AGCTTCACCC GCGGCGGCGG CGCCGGCGCG
ACCATCACCG ACCAGGTCGC CGACGACAAG GTGCTGGCCC AACTGGCCGG CGGGGCGACG
CTGGTGCTGC AAGCACTGCA CCGCACCTGG CCGCCACTGG TCCGGTTCGG CAGCGAGCTG
GCCGCCGAGC TCGGGCACCC GGTCCAGATC AACGCCTACA TCACGCCGCC GCAGAATCAA
GGCTTCGCGT CGCACTACGA CACCCACGAT GTGTTCGTCC TGCAGATCGC CGGCACCAAG
CACTGGCGCA TCCACGAGCC GGTGCTGCCC GATCCGCTCC CGCACCAGAC GTGGGACGGG
CGCCGGGCCC AGGTGCAGGA CCGCGCGGCC CAGGCGCCGG CCATCGACGC CCTGCTGCGA
CCGGGCGACG CGCTGTACCT GCCCCGCGGC TACCTGCATT CCGCCGTCGC CCAGGGTGAG
CTGTCGATCC ACCTGACCAT CGGCGTGCAC CCGCTGACCG GTTACGACCT GGCCCGCGAA
CTGATCGCCG CCGCCGAGGA CGACCCCGAG CTGCGCCGAT CCCTGCCCAT GGGCGTGGAC
GTCACCGACG TCGACGCCAT GGCCACCCAC CTGCGCCAGG CGGCGCAACG GCTGGTCGAC
CGGCTCGGTC AGGCCGGGCC CGAGCTCTAC CGGGCCGCGG CCCGACGGGT CGGGCCCCAG
CAGGTCGGGC AGACCCGGCC GGCGCCGATC GCGCCGCTGG CCCAGCTGCG CGCGGCCGCG
ACGCTGGACC CGCAGACCCC GCTGGTGCTG CGCCCCGGCC TGCGGCCGCG GCTGCGGCAA
CAGGGTGAGA AATGGGTGCT CAGCCTGATC GACTCCACCG TCAGCTGGCC CGAACAGGTG
CACGCCGCGC TGCTGATCGT GTTGTCCGGC AAGGCATTCA CCGCCGACGA GCTGCCGAAC
CTGGACGACG CCGAGCAGCT GGTGGTCGCC CGCCGGTTGC TGCGCGAAGG CATCGTGATC
CCGGCCGTGA TTCCGGCTGT GATCCCGACT GTGATCCCGG CTGTGGCCCC CGGCCCGGAC
GGCGGCGCGG CCGGACCGGC GACCGACGAC GGTGGCTGA
 
Protein sequence
MPALDSTGAS GSAGRPPRHW SPVQRCIAID ADDFAQRYWA QAPLLTTAAE LNDDFSDLFS 
ADSVDELVSE RGLRTPFLRM AKNGSVLSSA SFTRGGGAGA TITDQVADDK VLAQLAGGAT
LVLQALHRTW PPLVRFGSEL AAELGHPVQI NAYITPPQNQ GFASHYDTHD VFVLQIAGTK
HWRIHEPVLP DPLPHQTWDG RRAQVQDRAA QAPAIDALLR PGDALYLPRG YLHSAVAQGE
LSIHLTIGVH PLTGYDLARE LIAAAEDDPE LRRSLPMGVD VTDVDAMATH LRQAAQRLVD
RLGQAGPELY RAAARRVGPQ QVGQTRPAPI APLAQLRAAA TLDPQTPLVL RPGLRPRLRQ
QGEKWVLSLI DSTVSWPEQV HAALLIVLSG KAFTADELPN LDDAEQLVVA RRLLREGIVI
PAVIPAVIPT VIPAVAPGPD GGAAGPATDD GG