Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0018 |
Symbol | |
ID | 4598371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 19941 |
End bp | 20861 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639774632 |
Product | rhomboid family protein |
Protein accession | YP_921255 |
Protein GI | 119714290 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.417524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCGAAC ACGCGACGCC CCCGGCCGGG GTGCCGACCT GCTATCGCCA CCCCGGCCGC GAGGCGCACA TCCGCTGCCA GCGCTGCGAC CGGCCGATCT GCCCGGACTG CATGCGCGAC GCGGCCGTGG GGTTCCAGTG CCCCGAGTGC GTGGCCGAAG GCCGCAAGAG CACCCGTGCC GGCCGCACCG CCTTCGGCGG GCTGCGGCCG GGCAACGCCG GGACGACGTC GTTCGTCCTG ATCGGCATCA ACGCCTTCGT CTGGCTGATG ATCCTGGCCA GCGGGGGCAG CTCGAGCCGG GTGCTGGCCT GGCTGGAGCT GCGGCCGAAC GGGCTGTGCC TCAACGCCGC CGGCGGCTTC GACACCACCC GGGCCGTGTG CTCGGGCCGG GGCGAGTGGC TGCCCGGTGT GTACGACGGT GCCTACTGGC AGCTGCTCAC CAGCACGTTC ACGCACGTCC AGCCGTTGCA CATCGCGTTC AACATGTTCG CGCTCTACGT GCTCGGCCCC CAGCTGGAGC TGGCCATCGG CCGGATCCGC TTCCTGGCGC TGTACCTGCT CTCCGGGCTC ACCGGGTCCG CGCTGGTCTA CTGGGCGTCC CCGGAGTTCC AGGCCACCGT CGGCGCCTCC GGCGCGATCT TCGGCCTGAT GGGCGCGCTG CTCGTGGTGG CCTACAAGAT GCGGGCGAAC ACCCAGCAGA TCCTGATGTG GATCGGCATC AACTTCGTCT TCACGGTGGT GGTCAGCAAC ATCTCCTGGC AAGGCCACCT GGGCGGGTTC CTCGGCGGCC TGGTGATCGC CGCGATCCTC GTCTACGCCC CGCGCGGGCC GAAGCGCCCC TGGTTCCAGG TCTCCGGGCT GGTGCTGGTC GCCGCCCTCA CCGCCGTCGC GGTCGTCCTC CGCACCGCGG CCCTCGGCTG A
|
Protein sequence | MSEHATPPAG VPTCYRHPGR EAHIRCQRCD RPICPDCMRD AAVGFQCPEC VAEGRKSTRA GRTAFGGLRP GNAGTTSFVL IGINAFVWLM ILASGGSSSR VLAWLELRPN GLCLNAAGGF DTTRAVCSGR GEWLPGVYDG AYWQLLTSTF THVQPLHIAF NMFALYVLGP QLELAIGRIR FLALYLLSGL TGSALVYWAS PEFQATVGAS GAIFGLMGAL LVVAYKMRAN TQQILMWIGI NFVFTVVVSN ISWQGHLGGF LGGLVIAAIL VYAPRGPKRP WFQVSGLVLV AALTAVAVVL RTAALG
|
| |