Gene Arth_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3606 
Symbol 
ID4443917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4049346 
End bp4050413 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content64% 
IMG OID639691430 
ProductLacI family transcription regulator 
Protein accessionYP_833081 
Protein GI116672148 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGGCC AGGAGCCTCA CCGAGGAGGA GAAGTGAAGA AGGCGCCAAC AATTCGCGAT 
GTCGCGTCGG CGGCCGGAGT TTCAGTCTCG GTGGTGTCCC GGGTACTGAA CCCTGACTCA
GGCCCGGTCG CGCCGGCCAA GCGCGAAACG GTGTTGCGGG TCATTGACGA ACTCGGGTAC
CGGCCCCGTG CCGCCGCCCG TGAACTCAGC GTCGGCCATA CCCCCACAAT TGGCCTGGTG
GTGGCAGACC TGGCCAACCC CTTTTTTGCG CAGCTGGCCG ACCGCGTCGT CTGGGAGGCC
CGCAGCCACG GCGTGCAGGT GATGGTGATG ACCACCCAGG AAGACCCCCA CCTTGAAGCC
GACAGCCTGG ACACACTCCT TGACCGCTCA GTCGGAGGAG TCATTGCCAC GCCCACCGGA
GCAAACATTG AGAAGTGGGC ACGCCTCCAG TCCCTGGGTG TGAATGTGGT CTTCGTCGAC
CGCACCATTC CGGAACTCGA AGACGTGGAC CTGGTGAGCA TCGAAAACGT AGATTCGGCC
CGACGCGCCA CCGAACACAT GCTCGGCCTC GGACACAGGC GTATTGGCCT CATTACCGGG
CCCGTCAGCA CCTCGACGGG GCGGTCCAGG ATCGAGGGCT ACAAAGCCGC GCACAACAAC
TTTTCGATCA GCGTGGACCC CCAGCTCATC CGGGATGTAG CGTTCCGGGG AGACGGCGGG
GGTGACGCCG TCGGGTCTCT TCTGGCCCTG CCGGACCGCC CCACGGCGCT CATTGTGGCC
AACACCGCCC AGGTTCTGAG CTCCGTCCGG CGCCTTGTCC AGATCGGCGT GCGGATACCT
GACGATCTGT CGGTCATTGT CTTTGACGAC AACCCCTGGA CAGAACTGAC CAACCCGCCG
TTGAGCGTGA TCCGTCAGCC GATCGACATG CTCGCGGTGC ACTCGTTGGA GCTGGTTCTG
GGCAGGATGC AGGGACGGCT CCCTGCGGCT CCCCGCACGA TTGAAGTTAA GGCCGACTTT
GTCCCACGCA GCAGTTGCTC ACCTCTCGCC CTCTCCCCTA TCAGCTAA
 
Protein sequence
MPGQEPHRGG EVKKAPTIRD VASAAGVSVS VVSRVLNPDS GPVAPAKRET VLRVIDELGY 
RPRAAARELS VGHTPTIGLV VADLANPFFA QLADRVVWEA RSHGVQVMVM TTQEDPHLEA
DSLDTLLDRS VGGVIATPTG ANIEKWARLQ SLGVNVVFVD RTIPELEDVD LVSIENVDSA
RRATEHMLGL GHRRIGLITG PVSTSTGRSR IEGYKAAHNN FSISVDPQLI RDVAFRGDGG
GDAVGSLLAL PDRPTALIVA NTAQVLSSVR RLVQIGVRIP DDLSVIVFDD NPWTELTNPP
LSVIRQPIDM LAVHSLELVL GRMQGRLPAA PRTIEVKADF VPRSSCSPLA LSPIS