Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4383 |
Symbol | |
ID | 4596901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4635033 |
End bp | 4636031 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639778993 |
Product | AraC family transcriptional regulator |
Protein accession | YP_925567 |
Protein GI | 119718602 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.433153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAACC CGCTCGCCGA CGCGTTGGCC AAGCTCCGTC TCGAGGGCGC GATCTTCCTG CGCGGCTCCT ACAGCGAGGC GTGGGCCTAC GAGTCCGTCC CGGGGGCCGA CCTTGCCGCG CTGCTCGTGC CCGGGGCGGA GCGGGTCGTC CTCATGCACG TGATCGGCCG GGGGCGCTGC TGGATCCAGG TGGGCGACGG GGACCGGCAC TGGGCCGACG CCGGTGACGT GGTCGTGCTG CCCTACGGCG ACACCCATCG GATGGGCGGG GTCGAGTCGG TCGAGCCGGT CCAGGTCGGG ACCCTCGTCC AGCCGCCACC GTGGACCCGG ATGCCCGCCA TCGAGCACGG CGGTGGCGGC GACCCGACCC AGGTGGTGTG CGGCTATCTC GCGAGCGAGG ACCCGCTGTT CGACCCGCGG CTCTCGGCGC TGCCGCCGGT GTTCGTCGTC AGTCCGGTCG GCGAGGCCCA GGAGTTCGTG CGGGCCAGCA TCGCCTACGC CCTGCAGCAG ACCGCCCAGG TCGCGGACGG CCGGTTCGAG GTGCCGCCCC GGCTTCCCGA GCTGCTGCTG GTCGAGGTGC TCCGGCTGCA CCTCGCGAAC GCGCCCGCGG CGCACCACGG CTGGCTCGGT GCGCTCCACC ACCCGGTGCT GGCGCCGGCC ATGGCCGCGA TGCACGCCGA CCCGGCAGCG CACTGGAGCG TGGCCGAGCT GGCGCGGGTC GCCGCGGTCT CGGAGTCCCT CCTCGACGAG CGGTTCCGCA CGGTCCTCGG GATGCCCCCG ATCCGCTACC TGGCCGGCTG GCGGATGCAC CTGGCCCGGG ATCTGCTCGA GTCCAGCGAG CTCGGCGTGG CGGTGATCGC CCACCGGGTC GGCTACGAGT CGGAGGAGGC GTTCAGCCGG GCGTTCAAGC GGGCCCACGG CCGCTCTCCG CGGCAGTGGC GGCAGACCTC GGGCCGGGTG CCTCAGACGC CTCGCGAGCC CGGGTCGGAC CACAGATAA
|
Protein sequence | MTNPLADALA KLRLEGAIFL RGSYSEAWAY ESVPGADLAA LLVPGAERVV LMHVIGRGRC WIQVGDGDRH WADAGDVVVL PYGDTHRMGG VESVEPVQVG TLVQPPPWTR MPAIEHGGGG DPTQVVCGYL ASEDPLFDPR LSALPPVFVV SPVGEAQEFV RASIAYALQQ TAQVADGRFE VPPRLPELLL VEVLRLHLAN APAAHHGWLG ALHHPVLAPA MAAMHADPAA HWSVAELARV AAVSESLLDE RFRTVLGMPP IRYLAGWRMH LARDLLESSE LGVAVIAHRV GYESEEAFSR AFKRAHGRSP RQWRQTSGRV PQTPREPGSD HR
|
| |