Gene Namu_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2284 
Symbol 
ID8447895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2519020 
End bp2520240 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID645041406 
Productprotein of unknown function UPF0118 
Protein accessionYP_003201650 
Protein GI258652494 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00112442 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00386241 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCAGGA CGAAATCCGG GCAGCCGAGC GAGCCGGTCG CGCACCATCC GCGGGTCGGA 
CAGCCGCACG ACGCCGCCGC GGAGGGTCCG ATTGCCCAGG CCGAGGCGGT CGCCGCCACG
CTGCGCACGC CGGCCGACCC GCTCGGGCCG CTGGGCCGCC GGCTGAACTG GCGCTCGGCC
TTCCTGATCG GGCTGGCCGC CACCGCCGGG GTCGCGGTGA CGGTCGGGAT CATCCAGATG
CTGCTGCTGG CCGGTCAGGT GCTGCTGCTG ATCGCGCTGG CCCTGTTCCT GGCCATCGGC
CTGGAACCGG CCGTGTCCTG GCTGATCACC CACCGTTTCC CGCGCTGGGC CGCGGTGTTC
ACCGTGCTGG CCGGGCTGAC CCTGCTGCTG GCCGGGTTCA TCGCCGCCGC CATCCCGGCC
CTGGTCGAGC AGGGTCGGCG GCTGGTCGAG GCGGCCCCGC AGTACGTGGC CCAACTCAGC
GACGACAGCT CGGCCATCGG CCAGCTCAAC CAGCGCTTCC ACCTGCAGGA GACGGTGCAG
CAGATCGTCG ACGGCGGCGG CCCGGGCCTG GCCTCCGGGG TGATCAGCGT CGGCGAGGCC
GTCTTCGGTG CGTTCTCCGG TCTGCTGGTG GTGGCCGTGC TCACGGTCTA CTTCCTGGCC
GACATGCCCC GCGTGCGCAC CACCCTGTAC CGGTTCATGC CGGCGCCCCG GCGGCCCCGG
GCGATCCTGC TGGGGGACCA GATCATGGTC AAGGTCGGCG GGTACGTGCT GGGCAACGTG
GTCATCTCGG TGATCTCGGC GGTGGTCACC TTCGTCTGGC TGATCGCATT CGGCGTGCCC
TACCCGCTGT TGCTGGCCAT CCTGTTCGCC CTGCTCGACC TGATCCCGGT GATCGGATCG
CTGATCGCCG GAGCCCTGGT CGCCCTGGCC GCGTTCAGCG TGTCGGTGCC GGTCGGGCTG
GCCACCATCG GCTACTTCGT GGCCTACAAG CTGGTCGAGG ACTACCTGCT CACGCCAAAG
GTGTTCGGCC GGGTGCTGCG GATGCCCGCG CTGGTCACCG TCTGCGCGAT CCTCATCGGC
GGCGCCCTGC TCGGGCTGGT CGGCGCGCTC GTCGCGCTGC CCACGGCGGC CGCGATCATG
CTGCTGGTGC AGGAGGTGGT GTTCCCGCGC CTGGATCGCG CCCATGCCAG CGGCGAGCCG
GCAGCTGACC CGGCGGCCTG A
 
Protein sequence
MTRTKSGQPS EPVAHHPRVG QPHDAAAEGP IAQAEAVAAT LRTPADPLGP LGRRLNWRSA 
FLIGLAATAG VAVTVGIIQM LLLAGQVLLL IALALFLAIG LEPAVSWLIT HRFPRWAAVF
TVLAGLTLLL AGFIAAAIPA LVEQGRRLVE AAPQYVAQLS DDSSAIGQLN QRFHLQETVQ
QIVDGGGPGL ASGVISVGEA VFGAFSGLLV VAVLTVYFLA DMPRVRTTLY RFMPAPRRPR
AILLGDQIMV KVGGYVLGNV VISVISAVVT FVWLIAFGVP YPLLLAILFA LLDLIPVIGS
LIAGALVALA AFSVSVPVGL ATIGYFVAYK LVEDYLLTPK VFGRVLRMPA LVTVCAILIG
GALLGLVGAL VALPTAAAIM LLVQEVVFPR LDRAHASGEP AADPAA