Gene Arth_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0397 
Symbol 
ID4447124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp422504 
End bp423619 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID639688196 
ProductLacI family transcription regulator 
Protein accessionYP_829898 
Protein GI116668965 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGA GCACACAGGC AGCGGCGTCG GGTCCGGCCG CACCGGCCCG GTCGGCCCCT 
GCCGTCCCCA AAGCCGCTCC CGCACACCGG GGCGTCACCA TGGCGGACGT GGCCAAGGCC
GCCGGCGTGT CCCGAACCGC GGTTTCCTTC GTCCTGAGCA ACCGCGAGAA CGCCAGCATT
TCGGAAGAGA CCAAGCACCG CATCCTCGAA GCGGTCCAGA CCCTCGGCTA CCGGCCTAAC
GCGGGTGCAC GTGCCCTCGC GTCGCAGCGC AGCGACTGGT ATGGCATCGT CACAGAGATC
GTCACGGCAC CGTTCGCCGT CGACATCATC AAGGGCGCGC AGGACCAGGC CTGGCTGTCC
CGCCGGTTCT TGCTCATCGC GCCCTCCGAC CAGGCCGATG CAACAGGACC AAACCAGGGC
ATGGAAGACG CGGCCGTTGA AAAGCTACTG GAACAAAGAG TGGAAGGACT TCTCTACGCA
GCCACGTACC ACCGGGCCGT GCACGTTCCC AAAAGCGCCA ACGAGGTGCC CACTGTCCTG
ATCAACTGCT TCGACGCGGA CGGGAAGCTG CCCTCGGTCG TCCCTGACGA GCGGGCCGGG
GGCCGCGTCG CCGTCGAGCG TTTGCTGCAA GCGGGCCACA CCAGAATCGG TGTCATCAAC
CTGGATCCGG ACATTCCCGC CGCCGTCGGC CGTTTGGAGG GGTGCCGCGA AGCACTGGCC
GAAGCAGGGC TGGAGCTGGA TCCTGAACTC GTCGTCTCGG GACACGCAAC GGCGGATGGC
GGCTACGAGG CCGCCTGCGA AATTCTTGAT AAATATCAGG CCGGGGCAGG CAGGCCAACT
GCCCTGTTCT GCCTCAACGA CCGGATGGCT ATGGGCGCTT ACGACGCCAT CAAGGAGCGC
GGGCTCGCCA TCCCCCAAGA CATCGCCGTG ATCGGCTTCG ACAACCAGGA ACTCATTGCG
GCCTACCTCA GGCCCAAGCT GACCACGGTT GCGTTGCCCT TCGAGGAGAT GGGTGCGCTG
GGTGTCCAGA CACTCGCAAG CCTTACAGCC GGACAGCCGA TCACTGCACA TCAGCAAATG
GTCGACTGTC CGCTGCTAGA ACGCTATTCA GTCTGA
 
Protein sequence
MAKSTQAAAS GPAAPARSAP AVPKAAPAHR GVTMADVAKA AGVSRTAVSF VLSNRENASI 
SEETKHRILE AVQTLGYRPN AGARALASQR SDWYGIVTEI VTAPFAVDII KGAQDQAWLS
RRFLLIAPSD QADATGPNQG MEDAAVEKLL EQRVEGLLYA ATYHRAVHVP KSANEVPTVL
INCFDADGKL PSVVPDERAG GRVAVERLLQ AGHTRIGVIN LDPDIPAAVG RLEGCREALA
EAGLELDPEL VVSGHATADG GYEAACEILD KYQAGAGRPT ALFCLNDRMA MGAYDAIKER
GLAIPQDIAV IGFDNQELIA AYLRPKLTTV ALPFEEMGAL GVQTLASLTA GQPITAHQQM
VDCPLLERYS V