Gene Arth_3139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3139 
Symbol 
ID4444252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3522684 
End bp3523796 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content70% 
IMG OID639690965 
ProductLacI family transcription regulator 
Protein accessionYP_832617 
Protein GI116671684 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGCA AGGCCACCGC ACTGGACGTC GCCAAGCTGG CGGGCGTCTC GCGCAGCGCA 
GTGTCGCTCG TGCTCAACGG GCGGGGCGAC GGCAACGTGG CCCTCGAAAG CCAGCAGCGC
ATCCGGGAAG CCGCGGCAGC GCTGAACTAC ACGCCAAACG CCATCGCCCT GAGCCTGCGC
AACCAGCGGT CGCGGGTCAT CGGGATTGTC TCGGACGAAG TGGTCACCAG CCCGTTCGAC
GGCAACATCA TCGCCGGAGC GGATGCCGTG GCCCGGTCCC AGGGCTTTGT GACTGTAGTG
ATGGATACGG AATCGGACGA GGCCCGGGAC GAGGGCGCCG TGGCCACCCT TCTGGACCGC
CAGGTGGACG GGCTGATGTA CGTCACGGTG GGACTGCGCC CCCTGCACGT CCCGCTCAAC
ATGTTGCAGG TGCCGTCGAT CCTGGCCAAC TGCTTTGATG ACCGCCCAGG GGCCGGTGTT
CCCGCCGTCA TCCCCGATGA GGTCCGCGGC GGGCGGGAAG CCGCCGAACA CGTGATGTCG
CTGGGACATC GGGACATCGC CTTCCTCGCC GGCGACTCCC TTACCCCTGC GGCGCCCCGC
CGGATCGAGG GCTACCGCGA AGCGTTTGGC GCCGCGGACA TGCCCGTCAA CGGGGACCGC
GTCCTCCAGG TGGGCTGGGA TATCGATGCC GGTTTCCACG GCGCCATTAA GCTCCTCGAC
GGCGTGGAGC CGGCCGCCCG TCCCACCGCG ATCCTGTGCG CCAACGACCG CCTGGCCATC
GGCGTCGTAC TGGCCTGCTA CCGGCTGGGG CTCAGCGTTC CGCATGACGT GTCGGTCATG
GGTTACGACG ACGAATTCCG CATCGCCAAG AACATGGTCC CGGCGCTCAG CACCATGGCC
CTCCCGCTCC GGGAGATGGG CGCGGCAGCC ATGACGGCGC TGCTCGCCGA CGTGGGGTCC
GCACCGAACG GAAAGCACGA CGGCGGCGGC CCGGCTGCCG CCGCCGGCTC CGGTACCGGC
GCTGGGACCG ACGCCGTCCA CCACGCGGTG ACGATGGTTC CGTGCCGGCT GGTGGTCCGG
GATTCCACGG GCCCCGTCCC GGCCGGCCGC TAA
 
Protein sequence
MNRKATALDV AKLAGVSRSA VSLVLNGRGD GNVALESQQR IREAAAALNY TPNAIALSLR 
NQRSRVIGIV SDEVVTSPFD GNIIAGADAV ARSQGFVTVV MDTESDEARD EGAVATLLDR
QVDGLMYVTV GLRPLHVPLN MLQVPSILAN CFDDRPGAGV PAVIPDEVRG GREAAEHVMS
LGHRDIAFLA GDSLTPAAPR RIEGYREAFG AADMPVNGDR VLQVGWDIDA GFHGAIKLLD
GVEPAARPTA ILCANDRLAI GVVLACYRLG LSVPHDVSVM GYDDEFRIAK NMVPALSTMA
LPLREMGAAA MTALLADVGS APNGKHDGGG PAAAAGSGTG AGTDAVHHAV TMVPCRLVVR
DSTGPVPAGR