Gene Achl_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3799 
Symbol 
ID7295287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4240528 
End bp4241598 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID643592209 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002489841 
Protein GI220914532 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA GCACCACACC CCAGCAGTCC GCCGCCGTGC GGCAGCGGGG CGTCACCATG 
AATGACGTCG CCAAGCACGC CGGCGTTTCC CGGACAGCCG TTTCGTTCGT CCTGAGCAAC
CGCGAAAACG CCAGCATTTC CGAGGAAACC CGCACCCGGA TCAACGAGGC CGTCCAGGAA
CTGGGTTACC GGCCCAACGC CGGCGCCCGT GCACTGGCAT CCCAGCGCAG CGGCTGGTAC
GGCATTGTCA CCGAGATCGT CACGGCCCCG TTCGCCGTCG ACATCATCAA AGGTGCCCAG
GACCAGGCCT GGCTTGACCG CCGGTTCCTG CTCATCGCGC CCTCCGACCA GGCCGATGCC
GTAGGACCCA ACCAGGGCCT GGAAGATGCT GCGACGGAAA AGCTTCTGGA ACAGCGCGTG
GAAGGACTTC TGTACGCAGC CACCTTCCAC CGGGGCGTCC ACGTTCCGGA GAGCGCCAAT
GAAGTACCCA CTGTCCTGAT CAACTGCTTC GACGCTGACG GAAAGCTGCC CTCGATCGTC
CCTGACGAGC GTGCGGGTGG CCGGGTGGCC GTCGAACGGT TGCTCAAGGC CGGCCACACC
CGGATAGGAG TTATCAACCT TGACCCGGTG ATCCCTGCAG CAGTGGGGCG GTTGGAAGGT
GCGCGCGAAG CACTGGCCAC CGCCGGACTG GACCTGGATC CGGAACTGGT GGTGTCCGGA
TACGCGACGG CCGACGGCGG CTACGAGGCC GCCGGCCGGA TCCTGGACAG GTACCAGGGG
GAGCGCAGGC CAACGGCACT GTTCTGCCTC AACGACCGCA TGGCGATGGG CGCCTATGAC
GCCATCAAGG AACGGGGCCT GACCATCCCC GGAGACATCG CCGTGATCGG CTTCGACAAC
CAGGAACTGA TTGCGGCCTA CCTGCGGCCC AAACTGACCA CCGTTGCGTT GCCGTTCGAA
AAAATGGGAG CCCTGGGAGT CCAGACGCTC GCCGCCCTTA CAGCAGGACA GCCGATCATT
GCCGACCAGC AACTGGTCGA CTGTCCGCTG CTAGAACGCT CTTCGGTCTG A
 
Protein sequence
MAKSTTPQQS AAVRQRGVTM NDVAKHAGVS RTAVSFVLSN RENASISEET RTRINEAVQE 
LGYRPNAGAR ALASQRSGWY GIVTEIVTAP FAVDIIKGAQ DQAWLDRRFL LIAPSDQADA
VGPNQGLEDA ATEKLLEQRV EGLLYAATFH RGVHVPESAN EVPTVLINCF DADGKLPSIV
PDERAGGRVA VERLLKAGHT RIGVINLDPV IPAAVGRLEG AREALATAGL DLDPELVVSG
YATADGGYEA AGRILDRYQG ERRPTALFCL NDRMAMGAYD AIKERGLTIP GDIAVIGFDN
QELIAAYLRP KLTTVALPFE KMGALGVQTL AALTAGQPII ADQQLVDCPL LERSSV