Gene Arth_3766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3766 
Symbol 
ID4447851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4241676 
End bp4242761 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content68% 
IMG OID639691590 
ProductLacI family transcription regulator 
Protein accessionYP_833241 
Protein GI116672308 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA CGGCGGGCCG CGCACACCCT GACGCCCCGG AGCGACCCAA GCTGGAGGAC 
CTCGCCCGGA AGGTGGGAGT CAGCATCGCC ACGGTGTCCC GCGTGGTCAA TGGACGGAAA
GGCGTTTCGC GCGAGGTGCG GCAGTCGGTC CTCGCCGCGA TGGACGACCT TGGCTACGAG
CGGCCGGACC GTGCACGAAG CACCACCCGG GGCCAGGTGG GGATCATTGT TCCGGACCTG
ACCAATCCGA TCTTTCCGGC AATTGCGCAG ACTGTCGTAT CGCTCCTGTC CCAGGAGGAC
TTCATCCCGA TCCTCTGTGC CCTGCCGGGC GGGGGGCGCT CAGAAGACGA GTACATCGAG
ATGCTCGTGG CGCAGGAAGC GTCCGGAATC ATTTTCATCT GCAGTTCACA CGCCGACGGC
CAGGCCAGCC TGGAGCGTTA CCACCGGCTC CGCGGCCGCG GCATCCCGTT CGTCCTGGTC
AACGGTGCAC GTCCGGAACT GTCGGCCGCC TCCGTGTCCA ATGACGACGC CGCGGCAATC
AGCACGGCGG TGCACCACCT GGCCAGCCTG GGGCACCGGA AGGTGGGGCT GGCTATAGGC
CCGCACCGTT TCATCCCCAG CAGGCAAAAG CTGGCCGGAT TCCGCTCCGC CCTCGCCGAG
TACCTGGACA CCCAGGACCC GGAACCGCAC ACGGCTACCA GCATGTTCAC GGTGGAAGGC
GGGCAGAGCG CGGCCAATGA GCTCCTGGAC TCCGGCCACA CGGCCATAGT GTGCGCCTCC
GACGTCATGG CACTCGGCGC CATCCGCGCC GTCCAAGCCA GGGGGCTGCG CGTCCCGGAG
GATGTGTCCA TCGTCGGTTT CGACGACTCC CCGCTGATGG CGCTCACCAA TCCGCCGCTG
ACCACCCTCA GGCAGCCTGT CGCCGCGATC GCGCACGCCG CCGTCCATGC CCTGGCGGCC
GAAATTGCCG GCGAACAGTC CACCCGTTCG CCGGTGGTCC TGGCGTCCGA CCTGGTGGTG
CGTGGATCTA CCGGTCCTGC GGCAGCAGCT TCAGGCCCGC CGCGCAGCCC ACGATCCCGG
CGATGA
 
Protein sequence
MSTTAGRAHP DAPERPKLED LARKVGVSIA TVSRVVNGRK GVSREVRQSV LAAMDDLGYE 
RPDRARSTTR GQVGIIVPDL TNPIFPAIAQ TVVSLLSQED FIPILCALPG GGRSEDEYIE
MLVAQEASGI IFICSSHADG QASLERYHRL RGRGIPFVLV NGARPELSAA SVSNDDAAAI
STAVHHLASL GHRKVGLAIG PHRFIPSRQK LAGFRSALAE YLDTQDPEPH TATSMFTVEG
GQSAANELLD SGHTAIVCAS DVMALGAIRA VQARGLRVPE DVSIVGFDDS PLMALTNPPL
TTLRQPVAAI AHAAVHALAA EIAGEQSTRS PVVLASDLVV RGSTGPAAAA SGPPRSPRSR
R