Gene Arth_3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3479 
Symbol 
ID4443789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3914358 
End bp3916451 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content68% 
IMG OID639691303 
ProductKP-43 peptidase 
Protein accessionYP_832954 
Protein GI116672021 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA TAACCATCAA CGGCATCACC ATTGATCCGA TCAAACAGAA CCGGGCCCTG 
CGTGACGCCG GGCTGGTTGC CGAGGACGCC TCGGAGTCGG ACCACATCCT CATCCAGACC
GCCGAGCCGC TGACAGCGGA ACAGCGTGCG GAACTTGCCG GCATCGACGT CGAGATGCAG
GAATACGTCT CGGACAACAC CTACCTGGCG GCGTTCCCGC CGGCGGACCT GAACCGGGTG
CGCGCCCTGC CGTTCGTGAG CTGGGCGGAC GTTTACTCCC GCGTTTTCAA GATCCCGCCG
CCCCTGCTGC CGCGCAGCGC GGACACGGGC AATGTCCGGT CCCTGGCGGA CCATGAACCG
CATCCGGACC GGCGGCTGGA AAGAGTGGAC CTGCTCCTGC ACCCGGGCAT CGAGGCCGGA
CCGGAGCTGA TCGCCCGGGT GGCCGCCGCC GCCAGGGTGG AGCCCGACGC CGTCGCGGTG
ACCCCGGGCA AGCTGCGCAT CACCACGTCG GTCGGCCAGC TCCCGGAACT TGCCGCCATC
GATGAGATCC GCGAGATCCA TCCGGTCCGC GAGCGCCAGC TCTTCAACAA CGTGGCCCGG
GAGATCCTGA ACGCCGACGT TCAGCTCAAC GGGACAACAT ACCGCGGCGC CGGCGAGGTG
GTGGCCGTAG CGGACACCGG CTTTGACACC GGAGACGCCG CCAACCCGCA CCCGGCATTC
ACCGGACGGG TCCAGACACT CTACGCATTG GGCCGCACGG CACCGGACAA GGCAGACGAC
CCGCATGGCC ACGGGACGCA CGTGGCCGGC TCGGTGCTGG GCCGGGGAAA CTCGGCCACC
ATGGGCGGAG CGATTGAGGG CACGGCGCCG GAGGCCCTGC TGATCCTGCA GAGCCTCCTC
GACCCCAACG GCGGCCTGGG CGGCATCCCG GTCAATCTCA ACGACCTCTT CCAAAAGACG
TACGACGACG GCGCACGCGT ACACACGAAT TCCTGGGGCG TGCCCGGACT CAACCTTCCG
TACGATGCGA GTTCGCGGGA GATCGACGAA TTCGTGTGGA ACCACCCGGA CCAGGTGATC
TGCTTTGCGG CGGGGAATGA CGGCGTGGAC GGCAACAGCG ACGGCACGGT GGACTCGAAC
TCCATCGGTT CCCAGTCCGC TGCGAAGAAC TGCATCACCG TGGGCGCCAG CGAAAGCCTC
CGCAAGGAGT TCACGCCGTC CTACGGCACC TACTGGCCCG GAGATTTCCC CGCGAATCCC
GTCAAGCGGG ACAAGCAGGC CAACAACCCG GACGGGATGG TGGCCTTCTC CAGCCGCGGG
CCCACCAAGG AGGGCCGCAT CAAGCCCGAC GTCGTGGCGC CGGGAACCAG CATCCTGTCC
ACGCTCTCGC GGAACGCTCC GATGGGCAAC ACCTTCGGCA CCTCCACCGA TCCGCTGTTC
TTCTTCGACT CCGGAACTTC CATGGCCACC CCGCTGGTGG CCGGCTGCGC CGCGGTTCTG
CGCGAGACCT TGGTGAAGAA CGGCCTCAAC TCGCCAAGTG CGGCCCTGGT CAAAGCCCTC
CTGGTCAATG GCGCCGACGT CCTGCCCGGA CAGTACAACC CCAGTGAGGC CGGGGAATCG
CCGAACGGGA ACTCCGGGTG GGGCCGGGTC AACCTGGCCC GGTCCGTGGT CCTGCCCGGG
CAGCCCGGCA ACGCCGGCCT GGGCGAAGGG GGACCGCTGG AGCAGGGGCA GGAGGACTCC
TTCACCATCG ACATCCCTGA GGAAGTCCCG AAGGTTGCTG CCAAAGGGAG GCGGAACCGG
GGCCCGGCCG CGGAACCGGC GCTGACCGCA GCCGGGGTGA CGCTGAAGAT CACCCTCGTG
TGGTCCGATC CGCCCGGCCC GCAGCTGCAG AACGACCTCG ACCTCATTGT GCTGGCAGCC
GACGGCAGCG AGCGCCACGG AAATTCAGGA ACGACCGCCG GCTTCGACCG CCGCAACAAC
GTGGAACAGG TGCTCTGGAC GGGCATGCCG CCCGGCCAGG CCAGGATCGT GGTCAGGGCT
TTCCGGATCA CGCAGTTCCC GCAGCCCTAC GCCTACGTTT GGCGGCTGTC CTAG
 
Protein sequence
MSEITINGIT IDPIKQNRAL RDAGLVAEDA SESDHILIQT AEPLTAEQRA ELAGIDVEMQ 
EYVSDNTYLA AFPPADLNRV RALPFVSWAD VYSRVFKIPP PLLPRSADTG NVRSLADHEP
HPDRRLERVD LLLHPGIEAG PELIARVAAA ARVEPDAVAV TPGKLRITTS VGQLPELAAI
DEIREIHPVR ERQLFNNVAR EILNADVQLN GTTYRGAGEV VAVADTGFDT GDAANPHPAF
TGRVQTLYAL GRTAPDKADD PHGHGTHVAG SVLGRGNSAT MGGAIEGTAP EALLILQSLL
DPNGGLGGIP VNLNDLFQKT YDDGARVHTN SWGVPGLNLP YDASSREIDE FVWNHPDQVI
CFAAGNDGVD GNSDGTVDSN SIGSQSAAKN CITVGASESL RKEFTPSYGT YWPGDFPANP
VKRDKQANNP DGMVAFSSRG PTKEGRIKPD VVAPGTSILS TLSRNAPMGN TFGTSTDPLF
FFDSGTSMAT PLVAGCAAVL RETLVKNGLN SPSAALVKAL LVNGADVLPG QYNPSEAGES
PNGNSGWGRV NLARSVVLPG QPGNAGLGEG GPLEQGQEDS FTIDIPEEVP KVAAKGRRNR
GPAAEPALTA AGVTLKITLV WSDPPGPQLQ NDLDLIVLAA DGSERHGNSG TTAGFDRRNN
VEQVLWTGMP PGQARIVVRA FRITQFPQPY AYVWRLS