Gene Arth_2892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2892 
Symbol 
ID4444449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3258453 
End bp3259523 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID639690715 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_832371 
Protein GI116671438 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.266312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAA CACAGCCCCT AGTCCTGGGC ATCGAGTCCT CCTGCGATGA GACAGGTGTG 
GGAATCGTGC GCGGAACTGC GCTGCTCAGC AACACTGTGT CATCCTCCAT GGAAGAGCAT
GTCCGCTTCG GCGGAGTCAT CCCCGAGATC GCCTCCCGTG CACACCTGGA CGCCTTCGTG
CCCACCCTCC AGGAAGCCCT CGCGGACGCG GGAGTCCAGC TTGACGACGT GGACGCGATC
GCCGTCACTT CCGGTCCCGG GCTGGCCGGG GCTTTGATGG TGGGCGTGTG CGCCGCCAAG
GCGCTCGCGG TGGCCACGGG CAAACCGCTA TATGCCATCA ACCACCTGGT GGCCCACGTC
GGTGTCGGCC TGCTGCAGGA GGAGAACACC CTGCCTGAAC ACCTGGGCGC CCTGCTGGTT
TCCGGCGGCC ACACCGAGAT CCTCCGGATC AGGAGCATCA CCGACGACGT CGAGCTGCTG
GGCTCCACGA TTGACGACGC TGCCGGGGAA GCCTACGACA AAGTGGCACG GCTCCTGGGG
CTCGGCTACC CGGGCGGCCC GGCCATCGAC AAACTAGCCC GGACAGGCAA CGCCAAGGCC
ATCCGGTTCC CGCGCGGACT GACGCAGCCC AAGTACATGG GCACCGCGGA CGAACCCGGC
CCGCACCGCT ACGACTGGTC CTTCAGCGGA TTGAAGACCG CCGTCGCCCG TTGCGTGGAG
CAGTTCGAAG CCCGGGGCGA CGAAGTGCCG GTCGCGGACA TCGCGGCCGC CTTCCAGGAG
GCCGTTGTGG ACGTCATCAC GTCCAAGGCG GTGCTCGCCT GCACGGAAAA CGGCATCACC
GAGCTCCTGC TGGGCGGCGG GGTAGCCGCG AACTCGCGGC TGCGCCAGCT CACCGAACAG
CGGTGCAGGG CGGCCGGAAT CCGGCTGACT GTTCCGCCGC TTGAGCTGTG CACAGACAAC
GGTGCCATGG TGGCCGCCCT CGGTGCCCAG CTGGTCATGG CCGGCATCGA GCCCAGCGGC
ATCAGCTTCG CCCCGGATTC GTCCATGCCG GTCACGACGG TTTCGGCGTA G
 
Protein sequence
MNRTQPLVLG IESSCDETGV GIVRGTALLS NTVSSSMEEH VRFGGVIPEI ASRAHLDAFV 
PTLQEALADA GVQLDDVDAI AVTSGPGLAG ALMVGVCAAK ALAVATGKPL YAINHLVAHV
GVGLLQEENT LPEHLGALLV SGGHTEILRI RSITDDVELL GSTIDDAAGE AYDKVARLLG
LGYPGGPAID KLARTGNAKA IRFPRGLTQP KYMGTADEPG PHRYDWSFSG LKTAVARCVE
QFEARGDEVP VADIAAAFQE AVVDVITSKA VLACTENGIT ELLLGGGVAA NSRLRQLTEQ
RCRAAGIRLT VPPLELCTDN GAMVAALGAQ LVMAGIEPSG ISFAPDSSMP VTTVSA