Gene Arth_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3233 
Symbol 
ID4444014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3641760 
End bp3644198 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content70% 
IMG OID639691057 
Producthypothetical protein 
Protein accessionYP_832709 
Protein GI116671776 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAG TGCACGGCGG GCCGTCCCGG TTGGACGGCC CGCCGGCCTC TTACGCGCCC 
ACCACCGCCG GACGCCAGCC GGAAGACTCC GGGCCCGCTG TTGATTTGCC CGACGTTCCC
TTCCGGGCCG ACGGCATCGA GCTGATCGGC GAAACACAGG GATCCGGCTA CCGCGAGCCG
CCCTCGCTGG TGAGGCGCGC CGACGGGCAG GCCATCCAGC TCACCCGCCT GCTTTACTTG
GTACTCGAGG CGATCGACGG CAACCGCAGC GTCGACGAGG TTGCAGAGCA TGCCAGCGCC
CGCTTCGGCA GGCTGGTCAG CCCGGACAAC GTCCGCACGT TGATCAGCTC GCAGCTGCTG
CCCCTGGGAC TGCTCCGGCT GGCCGACGGT TCGCAGCCGG AGGTCAGGAA AGCCGACCCG
CTGCTGGGGA TGCGCTTCCG CTACACCGTC ACCGATCCGG ACCGCACGCG GAAACTGACC
GCCCCGTTTG CAGCGCTCTT CAATCCGCTC ATCATCGTGG CGGTGTGCGC AGCGTTCCTC
GCTTCCTGCT GGTGGGTGCT GATGGTCAAG GGACTCGGCT CCGCCACGCA CGACGCCTTC
GCCAACCCGG CCCTGGTGCT GCTGGTCCTG GCCGTCACCG TTTTGTCCGC CGGCTTCCAC
GAGTTCGGCC ATGCCGCCGC CGCACGCCGT GGCGGTGCCA CGCCGGGAGC GATGGGCGCC
GGCCTCTACC TGATCTGGCC CGCTTTCTTC ACCGACGTCA CCGACTCCTA CCGGCTGGGC
CGCGGCGGCC GGATCCGCAC GGACCTTGGC GGACTGTATT TCAACGCGAT CGTGGCCGTG
GCCATCATGG GTGTCTGGTG GGCCACCGGT TTCGACGCGC TGCTGCTGGT GGTGGTCACC
CAGATCCTGC AGATGGTCCG GCAGCTCCTC CCCCTGGTCA GATTCGACGG CTACCACATC
CTGGCCGACG CCACCGGGGT CCCGGACCTC TTCCAGCGCA TCAAGCCGAC CCTGTTGGGA
CTGCTGCCCT GGCGCCGGTC GGACCCGGAG GCCCAGGTGC TCAAGCCTTG GGCCCGGGCC
GTGGTGACCA TCTGGGTGCT GGTCACCGTG CCGCTGCTGT TGTTCAGCCT GGCAATGATG
GTGATCTCAC TGCCGCGGCT TCTGGGCACG GCGTGGGCCA GCGTGCTCAA ACAACAGTCC
CAGCTGACCG ACAGCCTCGC TGCCGGGGAC GTCGCCGGCG CCGCCGTCCG CGCCCTGGCG
ATCGCCGCCG TCGCGCTGCC CGTGGTGGGC ATCTTCTACG TCCTGCTGCG CCTGGTCCGT
CAGCTGACCA CGGGGCTCTG GCAGAAGACC CGCGGCAAGG CAATCCAGCG CGGGGTCGCG
ATGGCCGCCG TCGCTGCTGT GACCGCCGGC CTGGCCTGGG CGTGGTGGCC CGGAGCGGAC
ACGTACCGGC CGGTGCAGCC GTACGAACGC GGCACCCTGG CTGACGTTAC GACGGCGGTG
TTCCCCACGG CGTCGTCCAC AACGCTTCGG GAAGGACGCG CGGGAAAGAC TGTGGCACTG
TGGCCTGCGG GTGCAGCCAA GCCCACGAGG GAACAGCCCC AGCTGAGCAT GGTGATGGTG
CCCCGCACGG GTCCAGCCGC CGCCGGCACC CCGGACGCCG GCAGCGGTGC CGCCGCACCG
CCGTCGTGGG TGTTCCCGTT CAACCAGCCG GCCGCACCCG AAGAGGGCGA CAACCAGGCG
CTGGCGGTCA ACACGCAGGA CGGCTCGGTG GTGTACGACG TCGCCTTCGC GCTCGTCTGG
GCCGAGGACG GCGAACCGGT GGACACCACC AACGAGGCCT ACGCCTTCGC CAGCTGCTCC
GACTGCGCCG CGGTGGCAGT GGGTTTCCAG GTGGTGCTGA TCGTGGGCCA GGCGGATGTG
ATTGTTCCGG AGAACCTGTC CGCAGCCGCG AACTACAACT GCGTCCGGTG CCTCACGTAT
GCGCTGGCCA ACCAGCTGGT GCTCACGCTG GACGGACCGC TCAGCGGTGA CGGCATGGCC
CGGCTCAACG CGCTGTGGGC CGAGATTGCC GAATTCGGGC GGAACCTGCA GAACGTTCCG
CTGTCCGAAA TCCAGGGACG CCTCGAAGGA TTCAAGGAGC AGGTCATGGA GATCGTCCGG
AACGACCCCA GCGCCACCAA GGGCGCCGCG ACGTCCGCGA CACCAAGCTC CACGGCTACC
GCGACCCCCG GATCCAGCCA GGCCCCCTCA CCCGGAGCCA CGGCGGCGCC AACGGTCCCC
GCAGGAGCGA CGACGGCGGA TCCGGCGCCC GCTGCTCCCG CCACCGGAGG TGCGGCAACG
GAGACACCAG CTGCGACTGC GGAGCCGACG ATCACGCCGA CCGTGACGCC GACGGAACCG
GCACTGGCCA CACCTGGACC CACGTCGAAC GGCGAGTAA
 
Protein sequence
MSTVHGGPSR LDGPPASYAP TTAGRQPEDS GPAVDLPDVP FRADGIELIG ETQGSGYREP 
PSLVRRADGQ AIQLTRLLYL VLEAIDGNRS VDEVAEHASA RFGRLVSPDN VRTLISSQLL
PLGLLRLADG SQPEVRKADP LLGMRFRYTV TDPDRTRKLT APFAALFNPL IIVAVCAAFL
ASCWWVLMVK GLGSATHDAF ANPALVLLVL AVTVLSAGFH EFGHAAAARR GGATPGAMGA
GLYLIWPAFF TDVTDSYRLG RGGRIRTDLG GLYFNAIVAV AIMGVWWATG FDALLLVVVT
QILQMVRQLL PLVRFDGYHI LADATGVPDL FQRIKPTLLG LLPWRRSDPE AQVLKPWARA
VVTIWVLVTV PLLLFSLAMM VISLPRLLGT AWASVLKQQS QLTDSLAAGD VAGAAVRALA
IAAVALPVVG IFYVLLRLVR QLTTGLWQKT RGKAIQRGVA MAAVAAVTAG LAWAWWPGAD
TYRPVQPYER GTLADVTTAV FPTASSTTLR EGRAGKTVAL WPAGAAKPTR EQPQLSMVMV
PRTGPAAAGT PDAGSGAAAP PSWVFPFNQP AAPEEGDNQA LAVNTQDGSV VYDVAFALVW
AEDGEPVDTT NEAYAFASCS DCAAVAVGFQ VVLIVGQADV IVPENLSAAA NYNCVRCLTY
ALANQLVLTL DGPLSGDGMA RLNALWAEIA EFGRNLQNVP LSEIQGRLEG FKEQVMEIVR
NDPSATKGAA TSATPSSTAT ATPGSSQAPS PGATAAPTVP AGATTADPAP AAPATGGAAT
ETPAATAEPT ITPTVTPTEP ALATPGPTSN GE