Gene Arth_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0442 
Symbol 
ID4447067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp470838 
End bp472607 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content69% 
IMG OID639688241 
Productalpha/beta hydrolase fold 
Protein accessionYP_829943 
Protein GI116669010 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.895468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGCT ATGCCGAGCT ACGGGCCTTC CGGGCCACGC ACACGTACCG CTCGCTGACT 
GTCGGCGGCG TCGAATGGAA CTACGTCCCG GGGCAGGCAC CCCCGGGCGC GAGGGGCGCC
GGGCCGGCAC TCCTTGTCCT GGGCGGCGGG TTCTCGTTCG GCGAGTCCGC GTTCCGCACG
ATCACCGCCT TCGAACCCCG CTTCCGGGTC CTCGCCCCCT CGTACCCCGC GGTGCGGACA
ATGGCAGAAC TCCTGACCGG GCTCGCAGCC ATTCTCGACG CCGAGGGTAT CCCGTCCGCG
AACGTTTTCG GTCACTCCCT TGGTGCCGGC GTCGCCCACG CCTTTGCCCG CCGTTATCCG
CAGCGGGTGG ACAGGCTCGT CCTTTCCGGG TTCGGCCTCT ACACCCGCGG TCACACACGT
CTGGTGAGCG CCTTTGTCCG TCTGTTCTCA GTGTTGCCCA AGGCCGCCCT CGCCGCTTTT
TACCGTCCCA GGATTGCTCG GCTTCTCGAA GGGGCCGAAG AGGACGAGCG TGCTTTCCTC
AGCGCCTACA CGGAAGACCT CTTCGCAGCA CACACCAAGG AATCGGCCCT GGCACGGCTG
GCAGTGCTCC TTGACCTGGC GGCGCATCCG GACCTTTATT CCGCAGCATC GGCCTTCGAG
CGACCGGCGG ACGTGCTGCT GATTGCTGCG TCTGACGACC GCGGGTTCAC ACCGCGCGAA
CGCGAAGCCT TGCTGGCCAC CTATCCCGGC GCCAGGGCCC ACGTTTTCGG GCGAGGCGGC
CACTGGGCGG CGGTCACCCA CCCGACAGAG TACGATGCCG TCGTCGGCCG CTTCCTCGAA
GGCCGTCCGC CGCCCCCGGG GAGGAGCGAA CGCGCTCCCG CGCAGCCGCC CCGCCGCGCG
CCCCGCCGCG GCGGGGAAGA ACGGCTTAAC GCTTCGGCCC TGGACGCCAA ACTAGCCGCT
TTCCGTGCCG GTCACCGGTA CCGTACCGTC GACGTCGGCG GCGTGCGCTG GCGCTACCTC
GCCGGCGGTT CGGGCGAGCA GGTGTTGCTC CTCCCCTCCG GCGGGACCCG GGTGCCGGAC
ATGTATCTGC TGCTGATCGA AGCGCTGGAA CGGGACTTCC GTGTCCTTGC GCCGGCTTAT
CCCGCCGGCG CCGGAATCGC CGGGCTTGCG GACGGACTGG CCGCGATCCT CGACGCCGAG
GGGGTCAAAG AGGCGGATGT GCTGGGTTCG TCGTTCGGCG GGTTCGTGGC GCAGGTCTTT
GCCCGCCGGC ATCCCGAACG CGTGCGCCGA CTCGTGCTGG CGAACACCGG GGGCCCGGCC
GCGGCCCCGC TCCCGGGGCT GCCGCTCCTT ATCCGCTGCC TCGCCGTCCT TCCGGAAGAC
GCTGTGCGTT TCTTGACCGG CTGGAATTGG CGCCGCTGGT TTGTAACGGG CGCGAACGGC
GACTCGAAGT TCTGGGACGC ACTGCTGGGG GACATTCTCA GCCGGCTGGG CAAGGCGGAT
CTGCTGTCCG CATTGCGCGA AATGAACGAC TTCGCGCATC TGCCGGCCGA AGCGGTGCAG
GGGGTGCCCG CCGCCGCCAC ACCGCCGGCA CCGGTGCTGC TGATCGAATC CGAGCGGGAC
GAAGCATTCT CACCCCGGGC GCGGGCGGCT CTCCGCGCGC TCTATCCAGC GGCTGAGGTC
CGGCTTTTCG CGGGTGCGGG CCACGGGGTA ATGGCAACGC GGACGGCGGA GTATGTCGAG
ACAGTACGGG AATTCCTCCG CATGCCCTGA
 
Protein sequence
MDGYAELRAF RATHTYRSLT VGGVEWNYVP GQAPPGARGA GPALLVLGGG FSFGESAFRT 
ITAFEPRFRV LAPSYPAVRT MAELLTGLAA ILDAEGIPSA NVFGHSLGAG VAHAFARRYP
QRVDRLVLSG FGLYTRGHTR LVSAFVRLFS VLPKAALAAF YRPRIARLLE GAEEDERAFL
SAYTEDLFAA HTKESALARL AVLLDLAAHP DLYSAASAFE RPADVLLIAA SDDRGFTPRE
REALLATYPG ARAHVFGRGG HWAAVTHPTE YDAVVGRFLE GRPPPPGRSE RAPAQPPRRA
PRRGGEERLN ASALDAKLAA FRAGHRYRTV DVGGVRWRYL AGGSGEQVLL LPSGGTRVPD
MYLLLIEALE RDFRVLAPAY PAGAGIAGLA DGLAAILDAE GVKEADVLGS SFGGFVAQVF
ARRHPERVRR LVLANTGGPA AAPLPGLPLL IRCLAVLPED AVRFLTGWNW RRWFVTGANG
DSKFWDALLG DILSRLGKAD LLSALREMND FAHLPAEAVQ GVPAAATPPA PVLLIESERD
EAFSPRARAA LRALYPAAEV RLFAGAGHGV MATRTAEYVE TVREFLRMP