Gene Namu_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1059 
Symbol 
ID8446655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1170269 
End bp1171834 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content69% 
IMG OID645040197 
Producttail sheath protein 
Protein accessionYP_003200456 
Protein GI258651300 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCT ACGCCGCGCC CGGTGTGTAC GTCGAAGAAG TGGCGTCGAC CCAAAAGGTG 
CTGGCCGCCG CCCCGACGGC GGTGACCGCC TTCGTCGGCT TCACCGAGCG CTTCCCGACC
GACGACCCGG GCGATCCGGA GGGGCTGGCC CCCCGCCTGG TCACCAGCTG GTCGCAATTC
GAGGCCAGCT ACGGCGGGTT CACCCCCGGG GCCGTGCTGC CGCTGTCGGT GTACGGCTAC
TTCGCCAATG GCGGGGCGCT GGCCTACATC GTCCGGGTGC CCAACACCGC ACCGTCGGGC
GAGCCCTCCC GCCGGGAGCT GCCCGCCGCG GACCGCGCCC TCGGACTGCC ACTGGCCGTC
GAGAGTCTCG AGCCGGACGC CGATCTCACC CTCCGCGTGA CCACCCAGGA CACCGACGAG
GACGGCCCCA GCCCGTTCAC CCTGGACGTC CTGCAGGGGC TGGAGGTCGT CGAATCCTTC
CCCGACCTGA CCCTGGGCAG CGGCAAGCGC AACGTCGCGA CCGTGGTCAA CGACACCTCG
ACCAAGATCA AGGTGGAGGT GCTGCTGGAG TCCAAGACCG ACCTGTCCGG TCAGCTCGAG
CTGCTCAAAC CGGGCCTGTA CCCGCTGGAA AAGGCGGCCC CGTCGGCGGT TCCGGTGACC
GGGCGACGGT TCGCCGGCTC CGAGTCCAGC CGCCAGGGCA TCAACGGGCT GGCCGTGGCC
GACGACGTGA CGATCGTCGT GGTGCCCGAC CTGATCACCG CGGCGACCAA GGACGACGGC
ACCGTCGATC TGAACCTGTG GAAGGCCGTG CAGACGGCGC TGATCAGCCA CTGCGAGCAG
AACGGCAACC GGATGGCCGT GCTGGACGCG CCGCCCGGCA TGACGCCGCA GCAGATCCGC
GACTGGCGCA GCGACGTCGC CATGTACGAC TCCCCCTACG CGGCGCTGTA CTACCCGTGG
ATCAAGGTGG AGAACCCGAT CGGGGTCAAC GGCGACGCCG AGGTGTTCAT CCCGCCCAGC
GGGCACATCG CCGGCGTGTG GGCCCGCACC GACGAGACCC GCGGGGTGTG GAAGGCGCCG
GCGAACGACA CCATCCGCGG CTGCCTGGAT GTCGCCTACG GCGTCACCCA GAACGAGCAG
GCCGTGCTCA ACCCGATCGG CATCAACTGC ATCCGCCCGT TCGGCACCCG CGGCATCCGC
ATCTGGGGGG CGCGGACCCT GGCCAGCGAC TCGGACTGGC GCTACATCAA CGTCCGCCGG
CTGTTCAACA TGGTCGAGAA GACCATCGCC GACGGCACCC AGTGGGCGGT ATTCGAGCCC
AACGACGTGT CCCTGTGGGA GGGCATCAAG CGCACCCTCA ATGCGTTCCT GCGCGGGTTG
TGGAGCGCCG GTGCCCTGTT CGGCCAGTCC GTCGACCAGG CCTTCTACGT CAAGTGCGAC
GCCGAGAACA ACCCGCCGGA ATCGATCGAC CAGGGCCTGC TGATCGTCGA GGTGGGCATC
GCGCCGGTCA AGCCGGCCGA GTTCGTCGTC TTCCGCATCG CCCAGCACAA GCAGGTCGCG
AACTGA
 
Protein sequence
MPTYAAPGVY VEEVASTQKV LAAAPTAVTA FVGFTERFPT DDPGDPEGLA PRLVTSWSQF 
EASYGGFTPG AVLPLSVYGY FANGGALAYI VRVPNTAPSG EPSRRELPAA DRALGLPLAV
ESLEPDADLT LRVTTQDTDE DGPSPFTLDV LQGLEVVESF PDLTLGSGKR NVATVVNDTS
TKIKVEVLLE SKTDLSGQLE LLKPGLYPLE KAAPSAVPVT GRRFAGSESS RQGINGLAVA
DDVTIVVVPD LITAATKDDG TVDLNLWKAV QTALISHCEQ NGNRMAVLDA PPGMTPQQIR
DWRSDVAMYD SPYAALYYPW IKVENPIGVN GDAEVFIPPS GHIAGVWART DETRGVWKAP
ANDTIRGCLD VAYGVTQNEQ AVLNPIGINC IRPFGTRGIR IWGARTLASD SDWRYINVRR
LFNMVEKTIA DGTQWAVFEP NDVSLWEGIK RTLNAFLRGL WSAGALFGQS VDQAFYVKCD
AENNPPESID QGLLIVEVGI APVKPAEFVV FRIAQHKQVA N