Gene Arth_0620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0620 
Symbol 
ID4446901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp666045 
End bp667535 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content72% 
IMG OID639688418 
ProductGntR family transcriptional regulator 
Protein accessionYP_830119 
Protein GI116669186 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGAT CCCTCAATCC CACTGCCTTG GTGCGCCTCC TGGGCGGGTG GCACCGCGGC 
GCGGGGCCCG CCTACCGCGA GCTGGCCGAC GTCGTACGCC TCCTCATCCT GGACGGGCGC
GTCCCCCTGG ACATGGCACT GCCCAGCGAG CGCGCCCTGG CCGAAACCCT GGGTGTCAGC
CGGACAACGG TGACGGCCGC CTATGCCAGC CTCCGCGAGC AGGGGTTCCT GAGCAGCCGG
CAGGGCAGTC GCGGCAGGAC CTGCATTCCC CGCCAAATGC CGGCGGGTGC AGGGGGTCCG
GCGGCCGCTG AACTGATCAG CCCTAACGCG CTGGCCGGCG CACCGGGACT GGCAGCGCCG
CCGGGCCTGC TCGACCTCGC CTACGCGTCC TTGCCGGCCA GCGGCGAAGT GGTGCACCGG
GCCTTTGCCG CCGCACTGAC GGAGCTCCCT GCACTTCTTC CGGGCTTCGG CTATGACGCC
GTCGGCATCG CACCGCTGAG GGAGGCCATC GCCGCGCGGT ACACGGCGGC CGGAGCCCCC
ACCACGGCGG AGCAGATCCT GGTGACATCA GGCGCGCAGC ACGCACTGAA CATCGTGCTC
CGCACCCTGG CCGGCCGGCA GGACAAAGTA CTCGTGGACC ATCCCACGTA CCCGCACGCG
CTTGATGCCA TCCGCGCCAG CGGCTGCCGG CCGGTGCCGG TCGCACTCCC GCACGGGCGC
GGCTGGGACG TGGCCGGAAT GGAATCCGCC ATGATGCAGC AGCGGCCGAA GATGGCTTAC
GTGGTGCCCG ACTTCCACAA CCCGACGGGC CGGCTGATGA CCGATCCCCA GCGCCGCCGC
CTTGTCCGGG CGGCGGCGGC CGCGGGAACA GTGCTGGTGG TGGACGAGAC ACTTCGTGAA
TTGAACCTCG ACGCCGTGGG CGCGGCCCCG CTGGCGGCCT TCAGTCCGGC GGTGGTCACC
ATCGGCTCGC TGAGCAAGTC GCATTGGGCC GGCCTGAGGA CCGGCTGGAT AAGGGCCGGC
AGTTCACTGA TTCAGCGCTT CGCAGCTGCC CGGACCACCA TGGACCTGGG CGGACCGGTG
GTGGAACAGC TGGCGGCGGC ACACCTGGTC CGCTCGCTCG ACGAGCCGCT TCCTGCCCGG
CTGGCGGCCC TCCGCGAGAA TCGGGCAGCG CTGCTTGAGC TGCTCGGCGG GATCCTCCCC
GACTGGGAGC CGGAGCGGCC CGACGGCGGG CTGAGCGTCT GGTGCCGGCT ACCTGCACCG
ATCAGCACCG CGCTGACGGT CCTCGGCCCC GACTTCGGCG TGAGGCTGGC AGCCGGCCCA
AGGTTCGGCC TGGGCGGGGC TTTCGAGCAC TACCTGCGAA TCCCCTTCAC GCTTCCGCCC
GGGCAATTGG AGACGGCGGT GCGTGCCCTG CGGTCAGCCC AGGACAAGCT CGACTCCGCA
CCCCGTCTCC GCCGCAGCCT CCGCCGCACG CCCGCCGTCG CCATCGCCTG A
 
Protein sequence
MSGSLNPTAL VRLLGGWHRG AGPAYRELAD VVRLLILDGR VPLDMALPSE RALAETLGVS 
RTTVTAAYAS LREQGFLSSR QGSRGRTCIP RQMPAGAGGP AAAELISPNA LAGAPGLAAP
PGLLDLAYAS LPASGEVVHR AFAAALTELP ALLPGFGYDA VGIAPLREAI AARYTAAGAP
TTAEQILVTS GAQHALNIVL RTLAGRQDKV LVDHPTYPHA LDAIRASGCR PVPVALPHGR
GWDVAGMESA MMQQRPKMAY VVPDFHNPTG RLMTDPQRRR LVRAAAAAGT VLVVDETLRE
LNLDAVGAAP LAAFSPAVVT IGSLSKSHWA GLRTGWIRAG SSLIQRFAAA RTTMDLGGPV
VEQLAAAHLV RSLDEPLPAR LAALRENRAA LLELLGGILP DWEPERPDGG LSVWCRLPAP
ISTALTVLGP DFGVRLAAGP RFGLGGAFEH YLRIPFTLPP GQLETAVRAL RSAQDKLDSA
PRLRRSLRRT PAVAIA