Gene Namu_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1971 
Symbol 
ID8447580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2175931 
End bp2177721 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content60% 
IMG OID645041100 
Productprotein of unknown function DUF262 
Protein accessionYP_003201346 
Protein GI258652190 
COG category[S] Function unknown 
COG ID[COG3472] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0185458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00310192 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAAGT ACTCGGTGCA GCAGGAGTCT GTCGACGGGC TTCTCACCCT CGTCAAGGGC 
GGCAAGATCG CGATACCGGA ACTGCAACGT CCGTTCGTGT GGAACTCGAC CAAGGTCCGG
GATCTCCTTG ACTCGCTCTA CAAGGGCTAC CCGGTGGGAT ACCTGATCAC CTGGCAGTCG
GTGGGTGCAG CGCTGAAGGA CGGCCAGATT GCGCAGCACC AGCAGATCCT CATCGACGGT
CAGCAACGGG TGACCGCCCT TCGGGCGGCC GTCGCCGGAC TGCCCGTAGT CGACAATCAC
TACAAGAAAC AGAAGATCAC CATCGCGTTC AATCCACTCA CCGGTGACTT CGAGACCGTC
ACCCCGGTGA TCCGCAAGAC TCCGCAGTGG ATCCCCGACG TCTCGGAATT GTTCGGCGCG
ACGTCCACTT TCGCGTTCTT CTCGAAGTAT GTGGCTCGCA ATCCGGATGT CGACGTTGCC
CTCGTGGAGG AGTCGATCGA CCGTTTGCTG ACGATCAGGA GCGCGCAGAT CGGGATCATC
GCGCTGGCCG ACGACCTCGA GGTTGAGACG GTCTCTGAGA TCTTCATCCG GATCAACTCC
AAGGGCGTGC CGCTGTCCAG CGCCGACTTC GCGATGAGCA AGATCGCCAC CTACGGTGAC
CGCGGGCGTA ATCTGCGCAA GCTAATCGAC TACTTCTGCC ACCTGTCAGT CGCTCCGCAC
GTCTACCAGG ACATCGTCGA CAACGACCAC GAGTTCGTGG CCACGCCGTT CCTGCCGAAG
ATCGCCTGGC TCAAGGACGA TGCCGAGGAC CTGTACGACC CGAATTACAG CGACGTCATC
CGGATCGCCA ACCTCATCGG ATTCAACCGC GGCGTCGCAT CAGCATTCGT CAGCGAGTTG
TCCGGGCGTG ACCCGGAGAC CCGAAAGGTC GACGAATCGA GGATTCCGGT CGCTTACGAC
CGACTCGAAG ACGCGCTGCT TCGGATCGTC AACAAGTACG ACTTCCAGAA CTTCATCATG
ACAATCAAGT CGGCCGGCTT CATTACGACG GCGATGATCA GCTCCAAGAA CGCGCTGAAC
TTCGCCTACG CCCTGTACCT ACGTCTGCGG CACGACCCCA CAATGTCCGA GGGCGAGCGG
AAGAGCATCG TCCGAAGGTG GTTCGTCATG TCGATGCTGA CCGGGCGCAA CTCGGGGAGC
GTAGAGACCC AGTGGGAACT TGACGTTCGG CGGATCAGTC AGTACGGCGC GGCGGCCCAC
CTCAAACAGA TCGAAGAGGC GGAGCTGTCG GATGCCTTCT GGCAGGTGAC TCTACCCGGG
AACCTCGAGA CGTCCAGCGT CCGCAGCCCC TACTTCCAAA CGTTCCTCGC CGCTCAGGTG
AAGACCGGAG CGCGCGGCTT CCTGTCGAAA TCGATCACAG TCCAGGCGAT GCACGAGCAG
ATCGGCGACA TCCACCACAT CGTGCCCAAG GACTACCTCA AAAAGGAAGG CGTCACCGAC
CGGTCCGACT ACAACCAGGT AGCGAACTAC GTTCTCACCG AGACATCGAT CAACATCCGC
ATCAGCAACC GCGCCCCCGG CGCCTACATG GACGAGATCC GCACCCAGGT CGACACCGGA
AAGCTCACTC TCGGTGAGAT CACCGATGAT CAGGACCTCC GTCGAAACTT CGCCGAGAAC
GCCGTGCCGC ACGATCTCGA CACCGTCACC GCCGGCTCGT ACTTCGACTT TCTGGTCCGA
CGACGGGTCA TGATGGCACA GACCATCGGT CGGTACTACG ACGGACTATA G
 
Protein sequence
MAKYSVQQES VDGLLTLVKG GKIAIPELQR PFVWNSTKVR DLLDSLYKGY PVGYLITWQS 
VGAALKDGQI AQHQQILIDG QQRVTALRAA VAGLPVVDNH YKKQKITIAF NPLTGDFETV
TPVIRKTPQW IPDVSELFGA TSTFAFFSKY VARNPDVDVA LVEESIDRLL TIRSAQIGII
ALADDLEVET VSEIFIRINS KGVPLSSADF AMSKIATYGD RGRNLRKLID YFCHLSVAPH
VYQDIVDNDH EFVATPFLPK IAWLKDDAED LYDPNYSDVI RIANLIGFNR GVASAFVSEL
SGRDPETRKV DESRIPVAYD RLEDALLRIV NKYDFQNFIM TIKSAGFITT AMISSKNALN
FAYALYLRLR HDPTMSEGER KSIVRRWFVM SMLTGRNSGS VETQWELDVR RISQYGAAAH
LKQIEEAELS DAFWQVTLPG NLETSSVRSP YFQTFLAAQV KTGARGFLSK SITVQAMHEQ
IGDIHHIVPK DYLKKEGVTD RSDYNQVANY VLTETSINIR ISNRAPGAYM DEIRTQVDTG
KLTLGEITDD QDLRRNFAEN AVPHDLDTVT AGSYFDFLVR RRVMMAQTIG RYYDGL