Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0620 |
Symbol | |
ID | 4446901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 666045 |
End bp | 667535 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639688418 |
Product | GntR family transcriptional regulator |
Protein accession | YP_830119 |
Protein GI | 116669186 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGAT CCCTCAATCC CACTGCCTTG GTGCGCCTCC TGGGCGGGTG GCACCGCGGC GCGGGGCCCG CCTACCGCGA GCTGGCCGAC GTCGTACGCC TCCTCATCCT GGACGGGCGC GTCCCCCTGG ACATGGCACT GCCCAGCGAG CGCGCCCTGG CCGAAACCCT GGGTGTCAGC CGGACAACGG TGACGGCCGC CTATGCCAGC CTCCGCGAGC AGGGGTTCCT GAGCAGCCGG CAGGGCAGTC GCGGCAGGAC CTGCATTCCC CGCCAAATGC CGGCGGGTGC AGGGGGTCCG GCGGCCGCTG AACTGATCAG CCCTAACGCG CTGGCCGGCG CACCGGGACT GGCAGCGCCG CCGGGCCTGC TCGACCTCGC CTACGCGTCC TTGCCGGCCA GCGGCGAAGT GGTGCACCGG GCCTTTGCCG CCGCACTGAC GGAGCTCCCT GCACTTCTTC CGGGCTTCGG CTATGACGCC GTCGGCATCG CACCGCTGAG GGAGGCCATC GCCGCGCGGT ACACGGCGGC CGGAGCCCCC ACCACGGCGG AGCAGATCCT GGTGACATCA GGCGCGCAGC ACGCACTGAA CATCGTGCTC CGCACCCTGG CCGGCCGGCA GGACAAAGTA CTCGTGGACC ATCCCACGTA CCCGCACGCG CTTGATGCCA TCCGCGCCAG CGGCTGCCGG CCGGTGCCGG TCGCACTCCC GCACGGGCGC GGCTGGGACG TGGCCGGAAT GGAATCCGCC ATGATGCAGC AGCGGCCGAA GATGGCTTAC GTGGTGCCCG ACTTCCACAA CCCGACGGGC CGGCTGATGA CCGATCCCCA GCGCCGCCGC CTTGTCCGGG CGGCGGCGGC CGCGGGAACA GTGCTGGTGG TGGACGAGAC ACTTCGTGAA TTGAACCTCG ACGCCGTGGG CGCGGCCCCG CTGGCGGCCT TCAGTCCGGC GGTGGTCACC ATCGGCTCGC TGAGCAAGTC GCATTGGGCC GGCCTGAGGA CCGGCTGGAT AAGGGCCGGC AGTTCACTGA TTCAGCGCTT CGCAGCTGCC CGGACCACCA TGGACCTGGG CGGACCGGTG GTGGAACAGC TGGCGGCGGC ACACCTGGTC CGCTCGCTCG ACGAGCCGCT TCCTGCCCGG CTGGCGGCCC TCCGCGAGAA TCGGGCAGCG CTGCTTGAGC TGCTCGGCGG GATCCTCCCC GACTGGGAGC CGGAGCGGCC CGACGGCGGG CTGAGCGTCT GGTGCCGGCT ACCTGCACCG ATCAGCACCG CGCTGACGGT CCTCGGCCCC GACTTCGGCG TGAGGCTGGC AGCCGGCCCA AGGTTCGGCC TGGGCGGGGC TTTCGAGCAC TACCTGCGAA TCCCCTTCAC GCTTCCGCCC GGGCAATTGG AGACGGCGGT GCGTGCCCTG CGGTCAGCCC AGGACAAGCT CGACTCCGCA CCCCGTCTCC GCCGCAGCCT CCGCCGCACG CCCGCCGTCG CCATCGCCTG A
|
Protein sequence | MSGSLNPTAL VRLLGGWHRG AGPAYRELAD VVRLLILDGR VPLDMALPSE RALAETLGVS RTTVTAAYAS LREQGFLSSR QGSRGRTCIP RQMPAGAGGP AAAELISPNA LAGAPGLAAP PGLLDLAYAS LPASGEVVHR AFAAALTELP ALLPGFGYDA VGIAPLREAI AARYTAAGAP TTAEQILVTS GAQHALNIVL RTLAGRQDKV LVDHPTYPHA LDAIRASGCR PVPVALPHGR GWDVAGMESA MMQQRPKMAY VVPDFHNPTG RLMTDPQRRR LVRAAAAAGT VLVVDETLRE LNLDAVGAAP LAAFSPAVVT IGSLSKSHWA GLRTGWIRAG SSLIQRFAAA RTTMDLGGPV VEQLAAAHLV RSLDEPLPAR LAALRENRAA LLELLGGILP DWEPERPDGG LSVWCRLPAP ISTALTVLGP DFGVRLAAGP RFGLGGAFEH YLRIPFTLPP GQLETAVRAL RSAQDKLDSA PRLRRSLRRT PAVAIA
|
| |