Gene Arth_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1904 
Symbol 
ID4445558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2143256 
End bp2144506 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content58% 
IMG OID639689714 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_831386 
Protein GI116670453 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0620222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTA AGCGAAGTCC CACCATTTTC TGTGCGGTTG CGGCGGTTGT GAGCTTATGT 
CTTTGGACAA CAGGATGCAC AAATGGGTCT GGACCTGCCC CTGAGTCCTC GACTAGTTTG
TCCGGGCCGC CCACCACGCA AACGCCTGTC CCAAGCGAGA AAACCGCTGC CCCCAGCCTG
AGTCCATCCC CGAGCCCGAG TCCAAGCACG GAGGCCGTTC CCAGCAGCTG GCCGGACGTT
GTTGCCCAGA CGCAGTCGGG TGTCGGGCAG TTCAGCGTGA CGAGGTGTGA GAACAGCTTC
ACGGGCACGG GGTTCCTTGT TGGTCCTGAT CTGGTTGTCA CCGCTGCGCA CATGGTGCGG
GACGCGTCCG CGATAAGCAT TTCGTTTGGT CGAACAACGG TAAATGCCAC AACACTGGGC
ACAAACGAAC TCGCTGATTT GGCGCTCGTA AGAACCGAAA CCCCAGTTCA AGGTCATCAG
TTTCAGTTCA GAACAACTGA ACCGCCAATT GGAACGGATG TCGCTGCCCT CGGATTTCCC
TTAGGTCGGC CTTTCACTTT CACCCGGGGG ACAGTAAGCG CTCTGAACGC GGAACAGAGA
ATTGGTAGCC GAGTTCTTAG CAATCTGATT CAAACCGATA CGGCTATCAA CCATGGAAAC
AGCGGTGGAC CCCTTATTAC CCAGGATGGT CAGGTTTCTG GCGTTATCGT GACCATTGAA
TTCGACGAAA ATGTCCGGGC CGAAGGCATC GCCTACGCGG TGACTGCGCC ACGGGCTGCT
GCCGCCGTTC AGGAATGGCA GAAACGATCG GTGCCGGTAA CGCTCAAAGA CTGTGGCAAC
GCCCCGGCAC CGGGTTCAGG ATCTTTCCCG CTAACCGTCT TGTCCAGCCA CGATCAAGCA
CGCAACATCG GGCAGAGCCT CCTGCTGCAT GGCCAAGGCA TCAACCAAGG GGCCTACGCC
GCCGCCTTCA AGCAATTCAC TCCCGAACTC CAGGCAACTT TCGGTGACTC AGTTGCATGG
AGCGCTGAGC TTGGATCCTC GTACTGGCAA AAGGTCGAAA TCGTAGACGT TACAGGCAGC
GGCGATGCCC TCTCCGCTGA TGTGAACCTA CAAACACGCC AAGATGCAGC GCACGGCAGA
AACGGCCAAA CCTGTTCGAA CTGGAAGCTC CGCTATGCAA TGCATTGGGA CGGCAGCGCC
TGGCTCATAG CCGGCACGTC ATTACCCTTC GGTGAACCCA CGGCCTGCTG A
 
Protein sequence
MSIKRSPTIF CAVAAVVSLC LWTTGCTNGS GPAPESSTSL SGPPTTQTPV PSEKTAAPSL 
SPSPSPSPST EAVPSSWPDV VAQTQSGVGQ FSVTRCENSF TGTGFLVGPD LVVTAAHMVR
DASAISISFG RTTVNATTLG TNELADLALV RTETPVQGHQ FQFRTTEPPI GTDVAALGFP
LGRPFTFTRG TVSALNAEQR IGSRVLSNLI QTDTAINHGN SGGPLITQDG QVSGVIVTIE
FDENVRAEGI AYAVTAPRAA AAVQEWQKRS VPVTLKDCGN APAPGSGSFP LTVLSSHDQA
RNIGQSLLLH GQGINQGAYA AAFKQFTPEL QATFGDSVAW SAELGSSYWQ KVEIVDVTGS
GDALSADVNL QTRQDAAHGR NGQTCSNWKL RYAMHWDGSA WLIAGTSLPF GEPTAC