Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3861 |
Symbol | |
ID | 8449480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4232340 |
End bp | 4233722 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645042909 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003203145 |
Protein GI | 258653989 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00340007 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTGGC CGCTGTCGCT GCTGGTCGGC GTGCTCGTCG TGCTGGCCAT CACCGCGGCC ACCGCCTACT TCGTGGCCCA GGAATTCGCC TACATGTCGG TGGACCGCTC CCGGCTCAAG GCGGCCGCGG CGTCCGGGGA CGCCGGCGCC GAGCGCGCGC TGGCCGTGAT CCGGCGGACC TCGTTCATGC TCTCCGGGGC CCAACTCGGC ATCACGGTGA CCGGTCTGCT GGTCGGCTAC GTCGCCGAAC CGCTGATCGG CGCGGCGATC GGCGCGGCGC TGGGCGGGAT CTCGGTGCCG GCCGCCGTCG GCATCACCAT CGGCACCCTG CTGGCCCTGA CCTTTTCCAC CTTCGTGCAG ATGCTGCTGG GCGAGTTGTT TCCCAAGAAC CTGGCCATCG CCCGGCCGGA GCCGGTCGCC GTCCGGCTGG CCCGCTCCAC CACGCTCTAC CTGACCGTCT TCGGGTGGCT CATCGCCATC TTCGACAAGT CCTCCAACCT GCTGCTGCGG CTGCTGCGGA TCGAACCGGT GCACGACGTG GAGCATTCGG CGAGCCTGCG CGACCTGGCC CACATCGTCA CCGCCTCCCG GGACAGCGGA GACCTCTCGC CCGAGCTGTC GCTGCTGCTG GACCGGGTGA TCGACTTCCC GAACCGGACG GTCGCGCACG CGATGATCCC GCGGGCCCGG GTCGGCGTGC TGCATCCCGG GATCGACCTG GCCGGGATCC GAGAGGTGAT GCACACCGGC CATTCCCGGT ACCCGGTGCT GGACGACGCC GACGAGGTGC TCGGGGTGGT GCACCTGATC GACGTGCTCG AGCACACCGA CGACACGCTC ACCATCGAAC GCCTGATGCG GCCGCCCCAC TTCGTGCCGA CCGGTCAGAG CCTGCCGCGG GCCCTGCGGG AGCTGACCGC CGGCACCGAT CAGCTGGCCT GCGTGCTGGA CGAGTACGGC GGTTTCGTCG GGGTGCTGAC CGCCGAGGAC CTGGCCGAGG AGCTGGTCGG CGAGATCGCC GACGAGCACG ACCCCGTCGG CCCGCCGGTG CTGCCGCCGC CGTCGCCGAG CACCGGCTGG CAGCTGCCCG GGGACCTGCC GGTGGACGAG GCGGCCCGGT TGATCGGCCG GCCGCTGCTG ACCGGAGACT ACGAGACGCT GTCCGGGCTG GTCATCGCTC GGTGCGGCGC GTTCCCGGCT GTCGGGCAGG TGGTGGCTGT CCCGTTGCCG CCCGACCCGG CCCGATTGAC CCTGGCCGAG CCGGCCGGCG AACCGGTCCT GCGGCTGCGC GTGGACGAGC TGGCCCGGCA CGTGCCGGCC CGGATCACGC TGACCATCCA CGATCCCGAC GCGATCGCCT CGGCTGCGGA AGGGACGTCA TGA
|
Protein sequence | MIWPLSLLVG VLVVLAITAA TAYFVAQEFA YMSVDRSRLK AAAASGDAGA ERALAVIRRT SFMLSGAQLG ITVTGLLVGY VAEPLIGAAI GAALGGISVP AAVGITIGTL LALTFSTFVQ MLLGELFPKN LAIARPEPVA VRLARSTTLY LTVFGWLIAI FDKSSNLLLR LLRIEPVHDV EHSASLRDLA HIVTASRDSG DLSPELSLLL DRVIDFPNRT VAHAMIPRAR VGVLHPGIDL AGIREVMHTG HSRYPVLDDA DEVLGVVHLI DVLEHTDDTL TIERLMRPPH FVPTGQSLPR ALRELTAGTD QLACVLDEYG GFVGVLTAED LAEELVGEIA DEHDPVGPPV LPPPSPSTGW QLPGDLPVDE AARLIGRPLL TGDYETLSGL VIARCGAFPA VGQVVAVPLP PDPARLTLAE PAGEPVLRLR VDELARHVPA RITLTIHDPD AIASAAEGTS
|
| |