Gene Arth_3308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3308 
Symbol 
ID4444002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3713991 
End bp3715031 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content68% 
IMG OID639691132 
ProductLacI family transcription regulator 
Protein accessionYP_832784 
Protein GI116671851 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGGT CAGCCAGCAT CAAGGATGTT GCGAACCATG CCCGCGTAGC GGTGGGAACG 
GTGTCCAACG TCCTGAATTA CCCGGACCGG GTTTCACAGC GGACCAAGGA CCGCGTTCTG
CAGGCGATCG ACGAGCTTGG CTTCGTCCGC AACGACGCAG CCCGGCAGCT CCGGGCCGGA
CACAGCCGCA CCATCGGCCT GATTGTGCTG GATGTGGGCA ACCCCTTCTT CACCTCCGTG
GTCCGGGCCG CCGAGGACGC CGCCGCCCTG CAGGGAAGCG CCGTCCTGCT CGGGGACAGC
GGGCACGATG CCGGCCGGGA GTCGAACTAC ATCGACCTCT TCCAGGAGCA GAGGGTCCAG
GGCCTGCTGA TCTCGCCCGT GGGTGACGTC ACTGAGCGCC TCGACCAGCT GCGTGAGCGC
GGCGTCCCCA CCGTTCTGGT GGACCGGCTG GCCGATGAGA CGAAGTACAG CTCAGTTTCC
GTTGACGACG ACGCCGGCGG TTACCTCGCC GCACGGCACC TGCTGGACAT CGGCCGCCGT
CGGCTGGCTT TCGTGGGAGG CCCGACGTCG ATACGCCAGG TGGCGGACCG CCTCCAGGGG
GCGCAACGCG CCGTCGCTGA AGTTCCGGAC GCTTCACTTG AAATTCTGGA TTCGGCCGGA
CAGACCGTCC TGGCGGGCCG GGGCGTGGGC GACCAGCTGG TGCGCCGCAG CTCCGGCGAA
CTGCCGGACG GCGTGTTCTG CGCCAACGAC CTGCTCGCCC TCGGCGTAAT GCAGTCCCTC
ACCATGCTGC ACACTCTGCG GATCCCGGAA GACATCGCCC TGATCGGCTA TGACGACATC
GACTTCGCCG TGTCAGCCGT GGTGCCGCTG TCCTCGATCC GCCAGCCAAC GGAAGCGCTC
GGCCGGACCG CCATCGAGCT GCTGGCCGAG GAAGTGGACG CCATGGGGCC CGCCTCGGTG
CGGCCCCACC ACCGCGCCGT GATCTTCACT CCCGAACTGG TGGTGCGGCA AAGCACCGCG
GGCGCCGCCA CCCCGGCCTA G
 
Protein sequence
MSRSASIKDV ANHARVAVGT VSNVLNYPDR VSQRTKDRVL QAIDELGFVR NDAARQLRAG 
HSRTIGLIVL DVGNPFFTSV VRAAEDAAAL QGSAVLLGDS GHDAGRESNY IDLFQEQRVQ
GLLISPVGDV TERLDQLRER GVPTVLVDRL ADETKYSSVS VDDDAGGYLA ARHLLDIGRR
RLAFVGGPTS IRQVADRLQG AQRAVAEVPD ASLEILDSAG QTVLAGRGVG DQLVRRSSGE
LPDGVFCAND LLALGVMQSL TMLHTLRIPE DIALIGYDDI DFAVSAVVPL SSIRQPTEAL
GRTAIELLAE EVDAMGPASV RPHHRAVIFT PELVVRQSTA GAATPA