Gene Arth_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0801 
Symbol 
ID4446723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp865079 
End bp866602 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content71% 
IMG OID639688607 
Productmolybdopterin binding domain-containing protein 
Protein accessionYP_830299 
Protein GI116669366 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCTG CGCCGCCGGA CGGCGGTGAC GCCGCCAGCG ACGTTCCTGA CCACGGGGAA 
TCCTCCGGCC GCACCGCACC CGGCGCGGCT GAAGCCGGTG CCGGCACCGG TGCGCCCGAG
GCCGACACAG GATCCCACGA TGCCGGGCAC ACCCATTTGG CGCACACCTG GCAGGAAGCC
CGGCAGAGGT CATTCGATTG CGCCACACCC ATTCCGCCGG GACCGGTGCC GCTCAGGAGT
GCGCTGGGCC GGACGCTCGC TGCCGATATT ACGGCGCTCC AGGATATGCC CCACTACGCT
TCATCCGCCA TGGACGGCTG GGCAGTGAAC GGGACGGGGC CCTGGATTTT GGCCGAACCC
GGCCAGAGGC TTGCGCCGCA CCAGGCCAGT CCCATCGTCA CCGGCGGCCT CATACCACCA
GGGGCCAAGG CTGTGCTCCG CAGCGAAAGC GGCATGATCA CGACGGACGA CGAGGGGCTC
CCCATCCTTG CCCTTGGTGG CGCAGCAAGG CCCGGTGAAC CGAAGAACGG CCAGCACATC
CGGAAGGCAG GCGAGGAAGC GGCCGCCGGT GACGTCCTGG TCAAAAGCGG GGCAGTCCTC
AACCCGGCCC ACCTGGCCCT CGCGGCACTG GCCGGCCACG ACTCCCTCCG GGTGCAGGGG
AAACCTGTGG TCAGGATCCT GCTGACAGGC TCCGAGGTGG TTACCGCGGG CCTGCCTGCT
CCGGGCAAGG TGCGCGACAC CTTCGGCCCG CAGCTTGGGG CTGTGGTCGA GATGCTCGGT
GGGATCTGCG CCGGGCAAAA GAAAATAGGC GACGGCTACG ACGAATGGCT GGCTGCCCTG
GAAGACGACG GACCGGAATC GGCCGGTCCG GCGGAGAAGA CCGTGCCGGA TGACGGCTCC
GTGCTGTTGC CGGAAGAGCC GGTAGCGGAA GAAGCGCCTG CCGACGTCGT CATCACCACC
GGCGGAACCG GCCGGTCCGG GACTGATCAC CTCCGCCGTG CAGTCGCGGA ACTGGGCGGC
CGCCTGCTGA TCGACGGCAT CGCCATGCGC CCGGGACACC CGGCCGTCCT CGCCGAGCTG
CCGGACGGCC GCTTCATCCT TGGCCTGCCG GGCAATCCCC TTGCCGCGAT GATGGCCCTC
TGCACGGTGG GCGCGCCCCT GCTCGCCGCC CTTGGCCACG GAACCCTCCC TCCGGTCCAT
GAGGTGCCCT GCGGCGCGAT GATCGAGGCT GATCCGGGGC GGACCCGGCT GATGCCCTTC
AGGCTGCTGT ACGGGATGGC GTCCCCTGCC CGGCACGCGG GGCCCGGCAT GATGCGCGGT
CTTGCTGCTG CCGACGGCGT CCTTGTTGTT CCGCCGCACG GCGTCCAGCT GGGCGAAGCG
GTGCCCGCCT TCGCCTTGCC CTGGGGCGCT CCGATCCAGG CTGCGGAACC CGCCGCCGCG
AAGGCCAAAG CCGCTCCCCG CAAGGCCCCG CGAAAGCCCT CAGCCTCGGA CGGGCCGGTG
GACTGGAGTG CGCTGCTCGG CTAA
 
Protein sequence
MTAAPPDGGD AASDVPDHGE SSGRTAPGAA EAGAGTGAPE ADTGSHDAGH THLAHTWQEA 
RQRSFDCATP IPPGPVPLRS ALGRTLAADI TALQDMPHYA SSAMDGWAVN GTGPWILAEP
GQRLAPHQAS PIVTGGLIPP GAKAVLRSES GMITTDDEGL PILALGGAAR PGEPKNGQHI
RKAGEEAAAG DVLVKSGAVL NPAHLALAAL AGHDSLRVQG KPVVRILLTG SEVVTAGLPA
PGKVRDTFGP QLGAVVEMLG GICAGQKKIG DGYDEWLAAL EDDGPESAGP AEKTVPDDGS
VLLPEEPVAE EAPADVVITT GGTGRSGTDH LRRAVAELGG RLLIDGIAMR PGHPAVLAEL
PDGRFILGLP GNPLAAMMAL CTVGAPLLAA LGHGTLPPVH EVPCGAMIEA DPGRTRLMPF
RLLYGMASPA RHAGPGMMRG LAAADGVLVV PPHGVQLGEA VPAFALPWGA PIQAAEPAAA
KAKAAPRKAP RKPSASDGPV DWSALLG