Gene Arth_4482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4482 
Symbol 
ID4443428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008537 
Strand
Start bp106043 
End bp107821 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content60% 
IMG OID639687535 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_829232 
Protein GI116662177 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCC TGCCCTGGTG GGAACGCTAT GCCGGGCTCC TGCAGTCAGA AATTTCTTGG 
CTGCAGGACC TCGGCATTGC CTGCCGAATC GACGAAACCA AGCGTGACGA TCACCAGACC
TTGACCATGG AGTTGTCCGT ACCGGAAACA GTGACCGGAA CAGCCCCTCT GGAGCTGACG
GCTGTCTTCC CGGATTTCTA TCCGCTCGTT CCGCCGAAAG TCTTCGCCGT GGACCTAGGG
ATGCCTCATC ACTGGAACCC GTTCAGCAAC GAAGTGTGCC TGTTGGGAAC CCCATCGGAA
GAATGGGGAA CGAATGGGTC ACTGGCTCAG CTGCTGAAGG ACCAGTTACC TGCAGCGCTT
AAGGCAGGCA TGTCCGGCGA CGAGCATGCC GACTGGAATG AGAAGCCTCA AGCCGAACCG
TTCGGGGCCT ACTACAACAG CTACGCAAAC TCGGCGATGG TCTTTGTCGA CGGTTCGTGG
ACCCTACCTC GGGAAGCTGT TCAAGGGCCG ATTGAAGTTC GGCTTGACGG CCCATTTCCT
GGGCCTGAGC TGACGCAGCG GATGCTGGGC GGGGTCCTAC GAGTAGGGTC TGATACCGGC
CAGGCGCACG CGACGTTGGA AAACCAGATT CAGGAACTTC TGGGCGGCCA GTTGATACAT
GGCCGTTGGT CCAGGCTGTC GACGCCTGTT GCCGTTGACG ATGCCCGAGC CATCTGGGAG
GCTGCCGAAA AAGCCGACAG TAGCAGCACG CCCGACCAGC CCTACGGCCG GCGCAAGATC
CAAATACGGG GAGTGGTTTT CCCGGAAGAG TCCGGACCAA GACAAGTCTC CGAGGGGTGG
CTATTCGTGC TGAGGGTCCG CGGCGAGCAA CGACAGAACT ATAGTAAGAA GCCTCGTTCC
GGCCGTTCCC GACCCTTGAT GGTTTCTGGG GGCGCAGATG AGTACCACCT GCTCCGCGCA
GGCCGTGTTG GCCGTTCCGA CATCAGAGCA CGTGTTCCAG ACCTTGGCCC GTTGGCAACG
AAGAAAGTCG CGCTTGTCGG TGCAGGTGCC ATTGGAAGCG CTGTTGCAGT GCATCTGGGC
CGGGCTGGTG TAGGACACCT GAGCCTCATT GATGCTGACG TTCTCGAACC CGGGAACTTG
GTCAGGCACA CAGCCACCCT CGAAGGCGTA GGTTTTCTCA AGGCTGTCGC CGTGGCCAGA
CTGGTCCGGA TGGCAGCGCC GTACACCGAG GTCATGTACA ACCCTGCCTA TGTCGGAGGT
TCACGCCCGC ATGGAGAGCC GGACCCGCTA CGCCAAATCA GCCAGGATCT GGAATCTTAT
GACCTGTTGA TTGATGCCAG TGCTGATCGT GGTACACAAC GAATCTTGGC CTTGGTCGCT
CGTGAGGTTG ACGTGCCCTA CCTATCGCTC GAGGCCACGA ACGGTGCCTG GGGAGGCTTG
GTGGCTTACA TCGGGCGGGA CTCTCCATGG TGCCAGTCGT GCCTGGAGTG GTTCCGCTAC
GATCAGACCA TCCCGGACCC GGCACATTCC CCTTCCCCCT ACACGCAACC GATCGGATGT
GCGCAACCGA CATTCACCGG CGCCGCGTTC GACTTGGAAG AAGTGAGCCT TCAAGCTGCT
CGAACTGCCT CGGCCCGACT AACTTCCGAC GACCAAGGCG CAAAATCAGG TTTCGATGTC
GCCGTCTTGA AACTCCGAGA CGAAGCAGGT GAACGGACCT TGCCAACATG GACCGGATAC
AACCTTGGCC GGCACCCGCG ATGCGCGGGC CACAGTTGA
 
Protein sequence
MPFLPWWERY AGLLQSEISW LQDLGIACRI DETKRDDHQT LTMELSVPET VTGTAPLELT 
AVFPDFYPLV PPKVFAVDLG MPHHWNPFSN EVCLLGTPSE EWGTNGSLAQ LLKDQLPAAL
KAGMSGDEHA DWNEKPQAEP FGAYYNSYAN SAMVFVDGSW TLPREAVQGP IEVRLDGPFP
GPELTQRMLG GVLRVGSDTG QAHATLENQI QELLGGQLIH GRWSRLSTPV AVDDARAIWE
AAEKADSSST PDQPYGRRKI QIRGVVFPEE SGPRQVSEGW LFVLRVRGEQ RQNYSKKPRS
GRSRPLMVSG GADEYHLLRA GRVGRSDIRA RVPDLGPLAT KKVALVGAGA IGSAVAVHLG
RAGVGHLSLI DADVLEPGNL VRHTATLEGV GFLKAVAVAR LVRMAAPYTE VMYNPAYVGG
SRPHGEPDPL RQISQDLESY DLLIDASADR GTQRILALVA REVDVPYLSL EATNGAWGGL
VAYIGRDSPW CQSCLEWFRY DQTIPDPAHS PSPYTQPIGC AQPTFTGAAF DLEEVSLQAA
RTASARLTSD DQGAKSGFDV AVLKLRDEAG ERTLPTWTGY NLGRHPRCAG HS