Gene Arth_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2227 
Symbol 
ID4445288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2504268 
End bp2506310 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content64% 
IMG OID639690036 
Productendothelin-converting protein 1 
Protein accessionYP_831707 
Protein GI116670774 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0489343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAGC AACACCTCCA GCAACGTCCA CCATGCAGAC GCTGCCACCG GTGCCAGCGC 
CGGAGGCAGC CTTCACCTGA AGGAGCACTT GTGCCAAACT CGGGGATCGA CCTGTCCAAT
ATCGACCACA CCGTGCGGCC GCAGGATGAC CTGTACCAGC ACATCAACGG CGCGTGGCTG
AAGAGCACGA CCATCCCGGA CGACCGCCCC CTTGAAGGCA CTTTCACCGC TTTGCGGGAC
GGCTCCGAGC TGGCGGTCAG GGAAATCATC GAAGAAGCCG CCGGACGCGG CAAGGAAGCC
ACCGGGATCG AACAGAAGGT CGGGGATCTT TACAACAGCT TCATGGATGA AGCTGCCGTC
GAAGCCAAAG GCCTGGACCC CATCCGGGAA CGGCTCGCCT CCGTCTACGA CACGAAGTCC
GCGGCGGACG TCATCGAACT GGCAGGAAAG CTCTTCCGTT CCGACGTGTC AGGGCTTTTC
TACATCTACC CGGCACCCGA CGCGGGCAAC CCGGACCGGA TCCTGCTGTA CACCGGCCAG
GGAGGCCTCG GGCTGCCGGA CGAGTCGTAC TACCGCGAAG AAAAGTTCGC TCCCATGGTG
GCGGCCTACC GCGACCATGT CCGGACAATG CTCGGCCTCG CCGGAGCCGC GGACCTCGAT
GCCGCCGCTG AACGCGTGGT CCGGCTGGAG ACGGCCCTTG CCGCCCATCA TTGGGACAAC
GTGACCCTGC GGGATCCGCA AAAGACCTAC AACCTGAAAT CCGCCGAGGA GGCGCGGGAG
CTCTTCCCGC TGATGGATGC CTGGTTCGAG GCCGCCGGGA TCGGGCCGGA CAAACGCCAG
GAGATGGTGG TCAGCACCCC TGACTTCTTC GCCGGGGCAG CCGGCCTGAT CGAATCCGAA
CCGCTGGCGG CCTGGCAGGA CTGGCTGGCC ATGCGTGTCG TCAGCTCCGC AGCCCCGTAC
CTGTCGTCCG AATTCGTGGA CGCGAACTTC GCGTTCTACG GCACCACGAT TAGCGGCACC
CCGCGCAACA AGGACCGCTG GAAGCGCGGC GTCGCCGTCG TCGAGGCAGC ACTGGGCGAG
GCGGTCGGCC AGATCTATGT CTCCCGGCAC TTTCCCGAAA CCCACAAAGC CCGCATGCAG
ACGCTCGTCG CCAATCTCAT CGAGGCCTAC CGGCAAAGCA TCACCGCTCT GGCCTGGATG
GGCGAGGACA CGAAGCTCGA AGCCCTCAAG AAGCTCGAAG CATTCCGGGC CAAAATCGGC
TACCCGGATG AGTGGATCGA CTACTCCGCG GTGGAAATAG ACCCGGCCGA TCTGCTCGGA
AACGTTGAGC GGGCCCACAA TGCCGACGTG GACCGGCACC TGGACGAAGT TGGCAAGCCC
GTTGACAAGA ACAAATGGCT CATGACGCCG CAGACCGTCA ACGCTTACTA CCACCCGTTG
CTGAACGAGA TCGTGTTCCC GGCAGCAATC CTGCAGGCCC CGTTCTTCAC CGCGGACGCC
GACGACGCCG TCAACTACGG CGGTATCGGC GCCGTCATCG GGCACGAGAT CGGGCACGGC
TTTGATGACC AGGGCTCGCA GTTCGACGGC GGCGGTGCCC TGCGCAACTG GTGGACCGAG
GACGACCGGA CTGCGTTCGA AGCCCTGACC TCCAAGCTGG TGGCGCAGTT CGATGCCTTG
TCGCCTACTG CGGCCCCCGG CCACCATGTC AACGGCAAGC TCACCCTCGG CGAAAACATC
GGCGACCTGG GCGGGCTCAC CATCGCGTAC AAGGCTTACC TGATAAGCCT TGATGGTGCA
GAGCCGCCTG TTTTGGACGG CCTCACCGGA CCGCAGCGGT TCTTCGCGTC CTGGGCTGCC
GGCTGGCGCC AGGTGATCCG CAACGAGGAG GCAATCCGCC GTCTCGCTAC CGACCCGCAC
TCACCCAACG AGTTCCGCAC CAATGCCATC GCCAAGAACC TGGATGCGTT CCATGACGCG
TTCGGCGTGG CGGAGCAGGA CGGCATGTGG ATGCCCGCGG AGGAACGCGT CAGCATCTGG
TGA
 
Protein sequence
MKEQHLQQRP PCRRCHRCQR RRQPSPEGAL VPNSGIDLSN IDHTVRPQDD LYQHINGAWL 
KSTTIPDDRP LEGTFTALRD GSELAVREII EEAAGRGKEA TGIEQKVGDL YNSFMDEAAV
EAKGLDPIRE RLASVYDTKS AADVIELAGK LFRSDVSGLF YIYPAPDAGN PDRILLYTGQ
GGLGLPDESY YREEKFAPMV AAYRDHVRTM LGLAGAADLD AAAERVVRLE TALAAHHWDN
VTLRDPQKTY NLKSAEEARE LFPLMDAWFE AAGIGPDKRQ EMVVSTPDFF AGAAGLIESE
PLAAWQDWLA MRVVSSAAPY LSSEFVDANF AFYGTTISGT PRNKDRWKRG VAVVEAALGE
AVGQIYVSRH FPETHKARMQ TLVANLIEAY RQSITALAWM GEDTKLEALK KLEAFRAKIG
YPDEWIDYSA VEIDPADLLG NVERAHNADV DRHLDEVGKP VDKNKWLMTP QTVNAYYHPL
LNEIVFPAAI LQAPFFTADA DDAVNYGGIG AVIGHEIGHG FDDQGSQFDG GGALRNWWTE
DDRTAFEALT SKLVAQFDAL SPTAAPGHHV NGKLTLGENI GDLGGLTIAY KAYLISLDGA
EPPVLDGLTG PQRFFASWAA GWRQVIRNEE AIRRLATDPH SPNEFRTNAI AKNLDAFHDA
FGVAEQDGMW MPAEERVSIW