Gene Arth_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3574 
Symbol 
ID4443885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4015517 
End bp4016797 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID639691398 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_833049 
Protein GI116672116 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCGTT CAACACCCCC ACGCCCGACG CCGGGACGCC GGCAGTTCCT GCAGCTCGCC 
GCTGCCGGCG GTGCCGCCGT CGTTCTTTCC GCCGCACCAG GAACGGCGTG GGCCGCACCG
ACCGCGGGAC GGACCCGGTG CTACGTGCTC GTGGTGGACG GCTGCCGCCC GGACGAGATC
ACCCCCGCGC TGACGCCGCG GCTGGCTGAC CTGCGTGCAG CCGGCACCAA CTTCCCGGCG
GCACGGTCCC TCCCGGTCAT GGAGACCATT CCCAACCACG TGATGATGAT GAGCGGCGTG
CGACCGGACC GCTCTGGCGT TCCGGCCAAC GCGATCTACG ATAGGGCCGA GGGTGTGGTC
CGCGACCTCG ACCGGTCCAC GGACCTGCAC TTCCCGACCA TTCTGGACCG CTTGCAGGAG
CGCGGGCTCA CCACCGGCTC GGTGCTGAGC AAGAAGTACC TCTATGGCAT CTTCGGAGCC
AGGGCAAGCT ACCGGTGGGA ACCGCAGCCG GTGCTTCCGG TAACCGGCCA TGCTCCCGAT
GCCGCCACGA TGGACGCCCT GCTGGCGATG GCAGGCGGGC CGGATCCGGA CTTCGTGTTC
ACAAACTTGG GCGACATTGA CCGCGTGGGC CACTCCGACC TTTCCGGCAC CACGCTGCGA
GCCGCCCGGG AATCCGCACT GGCGGACACG GACCTGCAGG TGGGCCGCTT CATCGACCAT
CTCAAAGGCA CGGGCAAGTG GGAGTCCAGT GTGGTGATGG TGCTCGCCGA CCACTCCATG
GACTGGTCCA TCCCCACGAA CGTGGTTTCC GTCGACCTGG TCCTGCAGTC CCGTCCGGAG
TTGCAGCACA ACGTCAGGAT CGCCCAGAAC GGCGGGGCTG ACCTGCTCTA CTGGACCGGT
CCTGATGCAG AGCGTGCGGC CGGTATGGCT GCCGTCGAAC AGTTAGTCAG CGCCCATGAG
GGAGTGCTGT CCGTCCATAA ACCGGTGGAC CTGCGGCTGG GGACCGAGGC CGGAGACCTC
GTAGCCTACT GCCGCGCCGG CTGGCGTTTC TCCGACCCGT ATGTGGCTTC CAACCCGATC
CCGGGAAACC ACGGACACCC CGCCACCGAA CCCATCCCCT TCTTCATCTC CGGCGGCAGC
CCGCTGGTGG CACCCGGGAC GGTGTCCTCG GAGCATGCAA GGACTATCGA TGTTGCACCG
ACCATCGGCA CCATTTACGG GCTCAAAGCC CCGGACGGCG GGTATGACGG AACTTCGCGG
TCCGGCTCCC TGCGGCTCTG A
 
Protein sequence
MSRSTPPRPT PGRRQFLQLA AAGGAAVVLS AAPGTAWAAP TAGRTRCYVL VVDGCRPDEI 
TPALTPRLAD LRAAGTNFPA ARSLPVMETI PNHVMMMSGV RPDRSGVPAN AIYDRAEGVV
RDLDRSTDLH FPTILDRLQE RGLTTGSVLS KKYLYGIFGA RASYRWEPQP VLPVTGHAPD
AATMDALLAM AGGPDPDFVF TNLGDIDRVG HSDLSGTTLR AARESALADT DLQVGRFIDH
LKGTGKWESS VVMVLADHSM DWSIPTNVVS VDLVLQSRPE LQHNVRIAQN GGADLLYWTG
PDAERAAGMA AVEQLVSAHE GVLSVHKPVD LRLGTEAGDL VAYCRAGWRF SDPYVASNPI
PGNHGHPATE PIPFFISGGS PLVAPGTVSS EHARTIDVAP TIGTIYGLKA PDGGYDGTSR
SGSLRL