Gene Arth_3896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3896 
Symbol 
ID4445097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4387766 
End bp4388785 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID639691721 
ProductLacI family transcription regulator 
Protein accessionYP_833371 
Protein GI116672438 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTCA CCATCAGCGA CGTCGCCCAC GCGGCCGGCG TCAGCAAGGG TGCTGTCTCC 
TATGCCCTCA ACGGCCAGCC GGGAGTCAGC GAGGGAACCC GGGAACGGAT CCTCCAGGTG
GCCAAAGAAC TCGGCTGGAA ACCCAGCCTG CGGGCCAAAG GCCTCTCGTC CGCCAAGGCC
TACGCCTTGG GACTCGTCGT TGCCAGGGAC CCCTCCCTGC TCGGAACGGA CCCGTTCTTT
CCGGCGTTCA TCGCCGGCAT CGAAACGGCG CTCGCGGAAC ACGACTATAC CCTGGTGCTC
AGCGTCGCTA CGGGCGCCGG AGCGGAGGAA CGCTGCTACC GCAAGCTGGC CGAGAACGGA
AGGGTGGACG GGTTTCTCCT CACCGACGTC CGACACGACG ACTCCCGCAT CCCGCTGCTG
CAGGAACTCA AACTCCCGGC GGTCACCTTG AACCGGCCCG ACGCGGAATC GCCGTTCCCT
GCCGTTTCCA TGGACGACTG CGCCGGAATC ACGGCAGCCG TCGAGCATCT GGTGGCGCTG
GGCCACACCC GCATTGCGCA TGTCGGCGGC GGCCAGGAAT ACATCCACGG CAGGAGCCGC
CGCCAGGCGT GGGAAGACGC TCTGTCCGCT GCCGGCCTCC GCGCCGACCT GTTCGAGGAG
GCCGACTTCA CAGCCGCCGG AGGCATGGCC GCCACCGCCG ATCTGCTCCG ACGGGCGGAC
AAGCCCACGG CAATCGTGTA CGCCAATGAC CTGATGGCGA CCGCCGGCCA GTCTTACGCA
CAAACCCAGG GCCTCACTGT GCCGGGAGAT TTGTCCGTCA CGGGCTATGA CAACGCCGAC
TTCACCCAGT ACCTCAATCC GCCCCTCACC ACCGTCTCCA CCGATCCCAT GCTCTGGGGC
CAGGTTGCCG CCCAAGTGCT CCTCAATCAG CTCAGCAACG CGCACGACGG GCAAGACACG
GTTCTCCAGG CTCCGACTCT TTTGGTGAGG GCCTCCACCG GCCCGGTACC CGCCCGTTAG
 
Protein sequence
MAVTISDVAH AAGVSKGAVS YALNGQPGVS EGTRERILQV AKELGWKPSL RAKGLSSAKA 
YALGLVVARD PSLLGTDPFF PAFIAGIETA LAEHDYTLVL SVATGAGAEE RCYRKLAENG
RVDGFLLTDV RHDDSRIPLL QELKLPAVTL NRPDAESPFP AVSMDDCAGI TAAVEHLVAL
GHTRIAHVGG GQEYIHGRSR RQAWEDALSA AGLRADLFEE ADFTAAGGMA ATADLLRRAD
KPTAIVYAND LMATAGQSYA QTQGLTVPGD LSVTGYDNAD FTQYLNPPLT TVSTDPMLWG
QVAAQVLLNQ LSNAHDGQDT VLQAPTLLVR ASTGPVPAR