Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2651 |
Symbol | |
ID | 4599930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2820713 |
End bp | 2821705 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639777257 |
Product | hypothetical protein |
Protein accession | YP_923841 |
Protein GI | 119716876 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.514708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGTCT CGTCGGCCAG CGGCGCCAAG GACCAGGTCG CGCGGCTGCT CACCCTGGTC CCGTTCCTCC ACTCCCACGA CCAGGTCCGT CTCGACGAGG CCGCGGCGGC GCTGGGCGTC CCGCCCGACC AGCTCCTCGG CGACCTGAAG GTGCTGCTCA TGTGCGGGCT GCCGGGGGGC TACCCGGACG ACCTGATCGA CGTCGACCTG GACGCCCTCG AGGGCCCCGA GGCCGACGGC GTGATCCGGG TGTCGAACGC CGACTACCTC TCCCGCCCGC TGCGGCTGAG CCCCACCGAG GCGACCGCGA TCATCGTCGC CCTGCGCGCA CTGCGCTCCG GCGCCGGCGA GGAGACCCGC GAGGTCGTCG ACCGGGCGCT CGGCAAGCTC GAGGCGGCCG CCGCGGAAGG GTCGGCGGCC CGCCGGATCG ACCCCGGCGA GGTCGACCTC CAGCGCCTGG ACCAGGTCCG GCTGGAGGCG GACCTGCAGG ACGCGGTCGC CCGGGCGCGC CAGGTCCGGT TCAGCTACTT CGTCCCCTCC CGCGACCAGC AGTCCGAGCG CGTCGTCGAC CCGCGGGGCA TCGTCTCGGC GCACGGGTTC ACCTACCTGG ACGCCTGGTG CCACAGCGCC CGGGCACCGC GGCTGTTCCG CCTCGACCGG ATCCGCGACG CCGAGCTGCT CGACGCGCCC GTGAGCACCG CCCCGGAGGC GCCCCGTGAC CTCTCCGAGG GCCTGTTCAA GAGGTCCGAG GACACCACCC TGGTCACCCT CCGCGTGGAC CGCCCGGCCC GCTGGATCGT GGACTACTAC CCGGTGGAGG CGGTGCGGCC GCGCCGCGAC GGAGCGCTCG ACGTGGACCT GCTCGTCGCC GACCAGCGCT GGCTCCAGCG ACTGCTGCTC CGGCTGGCCC CCCATGCCCG GGTCGTCAGT CCCGCGGACC TCGACCAAGC GTTCACCCGC CGGGCACACG AGACGCTCAG CCTTTACGGC TGA
|
Protein sequence | MSVSSASGAK DQVARLLTLV PFLHSHDQVR LDEAAAALGV PPDQLLGDLK VLLMCGLPGG YPDDLIDVDL DALEGPEADG VIRVSNADYL SRPLRLSPTE ATAIIVALRA LRSGAGEETR EVVDRALGKL EAAAAEGSAA RRIDPGEVDL QRLDQVRLEA DLQDAVARAR QVRFSYFVPS RDQQSERVVD PRGIVSAHGF TYLDAWCHSA RAPRLFRLDR IRDAELLDAP VSTAPEAPRD LSEGLFKRSE DTTLVTLRVD RPARWIVDYY PVEAVRPRRD GALDVDLLVA DQRWLQRLLL RLAPHARVVS PADLDQAFTR RAHETLSLYG
|
| |