Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0707 |
Symbol | |
ID | 4599811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 745466 |
End bp | 746884 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639775308 |
Product | virulence factor Mce family protein |
Protein accession | YP_921919 |
Protein GI | 119714954 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.953173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGGC CGCGGATGCT GCGCGCCGTC GTCGCCGCTG CGGCCGGAGC GGTGCTGCTG ACCGGGTGCC AGTTCGACGT CTACGCATTG CCGCTGCCCG GTGGCACCGA CGTCGGCGAC AACCCGATCA CCGTCACCGC CGACTTCCGC GATGTGCTCG ACCTGGTCCC GAAGTCCACG GTCAAGGTCG ACGACGTGAA CGTGGGCGAG GTGACCGACG TCGACCTCGT CGAGGGCCAT GCCCAGGTGA CGATGAAGCT CCGCAACGAC GTCGACCTGC CCGACAACGC GATCGCCGAG ATCCGGCAGA CCAGCCTGCT CGGCGAGAAG TTCGTCTCGC TCAGCCCGCC GACCGACACC CCGGCCACAG GTGAGCTGGG CGACGGCGAC ACCATCGGTC TGGAGGACAC CGGCCGCAAC CCGGAGGTCG AGGAGGTCCT CGGTGCGCTC AGCCTGGTCC TCAACGGCGG TGGTGTCGCC CAGCTGAAGA CCATCGCCAC CGAGCTCAAC AAGGCGCTCG GCGGACGCGA GGACGCCGCC CGGTCGGTCC TGACCCAGAT CAGCGACTTC ATGGCCCAGC TCGACGAGAA CAAGCAGGAC ATCGTCAATG CCATCGACTC GCTGAACAAC CTGGCGGTCT CGGCCCGCGG GCAGCAGGAC TCGATCGACG CCGCGCTCGA CGAGCTGCCG AGTGCGTTGA CCTCCCTGGA CCAGCAGCGC GCCGACCTCG TCAAGATGCT GGGCGCCCTG AACCGGCTCG GTGGCGTGGG CGTCCGGGTC ATCAACGCCT CGAAGGACGC CACGATCGAG TCGTTGCGCC AGCTGAACCC GGTGCTCACC GAGCTGGCGA ACTCCGGCGA CGCGTTCGTG AAGTCGTTCA ACGTCTTCTT GACCTACCCG TTCGTCGACG ACGTCGTCGG CCGCGACCCG CAGGTCGCGC GCAACCTGCA CATGGGCGAC TACACCAACC TGTCGGTCGA GCTCGACATG GACCTCTCCG GCGGAGCCAG CGGCACCGCC CTGCCGACCC TGCTGCCCAG CGACCTCGAT CCGACGGTCG TCCTCGGCCG GGTCGCCCGG TGCATCCAGA GCGGGGACAT CAACAGCTTG CCGTGCCAGA GGCTGCTGAG CACCGTCGAG GGTCTGGCGA AGCTGCGCGA CGCGTGCCTG AAGAAGAAGA ACGAGAAGAC CGTCGTCTGC CGGATCGTCA ACCAGATCCC GGGGCTGCCC GAGGGCGGCC TCCCGACCGC CTTGCCGACC AGCCTCCCGA GCCTCCCCGA CCTGACCGGG ATCCTCGGCC TGGGCCGGAC CGGCGTCGGC CCGACGACCT CGGCGCCCGG CCCGACCCTG GGCCAGCTCA GCCGCCTGTT CGACCCGGCC CTCGTCCGCC TCCTCGTCCC GGGGATGGTG ACCCGATGA
|
Protein sequence | MMRPRMLRAV VAAAAGAVLL TGCQFDVYAL PLPGGTDVGD NPITVTADFR DVLDLVPKST VKVDDVNVGE VTDVDLVEGH AQVTMKLRND VDLPDNAIAE IRQTSLLGEK FVSLSPPTDT PATGELGDGD TIGLEDTGRN PEVEEVLGAL SLVLNGGGVA QLKTIATELN KALGGREDAA RSVLTQISDF MAQLDENKQD IVNAIDSLNN LAVSARGQQD SIDAALDELP SALTSLDQQR ADLVKMLGAL NRLGGVGVRV INASKDATIE SLRQLNPVLT ELANSGDAFV KSFNVFLTYP FVDDVVGRDP QVARNLHMGD YTNLSVELDM DLSGGASGTA LPTLLPSDLD PTVVLGRVAR CIQSGDINSL PCQRLLSTVE GLAKLRDACL KKKNEKTVVC RIVNQIPGLP EGGLPTALPT SLPSLPDLTG ILGLGRTGVG PTTSAPGPTL GQLSRLFDPA LVRLLVPGMV TR
|
| |