Gene Arth_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3940 
Symbol 
ID4444815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4450755 
End bp4454219 
Gene Length3465 bp 
Protein Length1154 aa 
Translation table11 
GC content67% 
IMG OID639691771 
Productputative ATP-binding protein 
Protein accessionYP_833415 
Protein GI116672482 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCATCG CAACCATGCT GCCGATGGGC GAGCTCACCA ACCCCGGCCA GATGCGTCTC 
GCGCTGGTGC AGGTGGTCAA CTGGGGCACG TTCCATGGCG CCCACACGAT GCACGTGGAC
CGTAACGGCA CCCTGCTGAC GGGCAACTCC GGAGTGGGCA AGTCCACGCT GTTTGACGCG
ATGCTGCGTG TGTTCGATGC GCGGCCGCGC TCCAACGAGG CCGCGGCGCA GCGCTCCGGC
GCTGTGGAGG ACAAGCGGAC AACGTTCACG TACATGCGCG GCAAGGTGGG CGACAAGGCT
GTGGGTGAGG GCTCTGCCAG TGCCTTCCAG CGCCCCGGCG CCACGTGGTC CGCCGTCGCC
CTAACGTTCG ACAACGCGGC CGGCACGAAG GTGACGGTGT CGGCGCTCTT CGACCTGCCC
AAGAACGGCA CGGAGTCGAG TGTCGGCCGG TATTACCTGA TCGACAACAA GCCGCTGGAC
CTTGCGGCGA TCGAGGGCAT CGCCGAGAAG CGCTTCACCA AGGCTGCGCT GGACACGATC
TTCCCGGACG CGCAGGTCTT CGACGTGCAC AAGGCGTTTG CCGAGCGGTT CCGGCGCCTG
CTGGGCATCA ATTCGGACCA GGCCCTGCCG TTGCTCCGGG TGATCCAGGC CGGCAAGGGC
CTCGGCGGCA GCGTCAACAC CTTCTTCCGG GACCAGGTGC TTGATGCTCC CGCCACGCTG
TCCGCAGCGG ATGACGTCGT CGAGGAATTC AGCAACCTGA TGTCCATCCG GCAGCGGCTG
GAGGACGTGC GGCAGCAGCG CGACCAGCTG GCTCCGGTTC CGGGCCTCAA CAAGGAATAC
GCGCAGTCCC TCCTGGATGC GAACCGCCTG CGGGAACTCG CCGGTGAAGA ATTCGATGCC
TACAAGCAGC AGCTCGCGGT GACGGTCCAC GAAAAGACCC TGGTCCGTTA CAAGGACCTC
GCCCGGGCCA AGGCTAAGGA CCTTGCCGCG GAGCGCGCGG TCCGGGACGG CCTGGCGAAG
GACCTGCGCC AGCTCGAGGC GGACTATAAC AACCAGGGCG GCAACGCGAT CTCGGCGATC
GAGCAGTCGC TGGAGAATGC CCGGGTGGGA CTCAAGCTCC GCCAGCAGGT GGAGGAAGCC
GCGCGGCAGG CGTTGGCCGA TGCCGGGCTG GAGCTCGCGT GGTCGGCCGA AGGCTGGGAA
CAGGCGCACA CCCACGCAGC CGACCGCTCG GCGGAACTGC AGGGTGACTC CGAGGCGCTC
AAAGAGCTGC GCTTCGAGGC ATTCGACGGC CACGCCACGA AGAAACGTGA GCTTGCGGCG
GCCCAGCAGG AACTCGTCTC GCTGAAGACG CGCAAGTCCC TGCTCCCGCC GTCGAGCATC
GAAAACCGGA CCGCCATCGC GGCGGCAACC GGCGTGCCGG AGGAGCAGAT GCCGTTCGGC
GGCGAACTGA TCGACGTCGC CGAAGGGGAG GAACAATGGC GGCCCGCCGC TGAACGCGCG
CTGCGCAACC TGGCCACCAC GCTGCTGGTT CCCGGCGAGC ACTTCGCCGC CGTCACCCGC
TACCTGAACA ACAACTCCGT CCGCGGCGCG CTCCGTGCCG TGGACGTGTC TAAGCCGCTC
GCTGGCGGCG CGCTGGCCGT TGAAAACGCG GCCGACGGCG ATCTGCTGAC CAAGCTTGAC
ATCCTCACCA CCGGGCCGGC GGCCGACGCC GGGCAGTGGA TCCGCGAGCG GATCGCGGTC
GACTTCGCCT ACCCATGCGT CGAGGACCCG GACGAACTCG CTGCGCTGGA CAAAGGCCTG
AGCCTGGGCG GCGTGGTCAA GCGCAACCGC CACACGGTGG AGAAGGACGA CCGCTTTGCC
AGCAGGCAGG ACTACGTCCT GGGCTTCGAC AACGCCTCGA AGCTCGAGCT GGTGGCTGCA
CAGGTGGAGG ACCTGCAGCA GGAACTGGCG AAGGCCGCCG AACTCGCGCA AAGCCGCGAG
GAGTCGCACC AGGGAATGAC CCGCCAGCTG GAGGCCCTAC GCCGGATCGC CGAGGACAAC
CGGCCCTGGG AGCAGGTTTC CGCGGCTGTC GCGGAGGACG AGCTCGGGAA GATCGAGCAG
CGGCTCAAGG ATGCCCTCGC CGCCCAGGCC GACCTGGAGC CGTTGCGCGC CAACATCGAG
GCGGCCCGGC AGAAGCACCA GTCCAGCACC GAGGCCGCAG CGGTCCTGCA AAGCGAATAC
AAGGCGCTGG ACCACCAGCT CACCGCGGCG GATTCGCTGC TGGAAGCCGC GCGCACCCGT
CTGCGCCAGG CACCGCCGTC GGACGCCACC GTCGCCGCGC TGGAACAATA CTGTGCCGAC
TTCGGCGGCG TGGACGACGT CGCGGAAATG CATGAGCTGG ACAACCTGGC CCACCAGGTC
CGGACCCGGC TCCTCGCAGA ACTGCACGCT GCAGAGTCCC GCGGCCAGGC CACCTCGGAG
CGCCTCACCC GTATCTTCGA AGGATTCGTC CGCGAATGGG GCACGGCGAT CTCCGCGGAC
CACGGCACCT CGATCGGCGC CGCCGGGGAG TTCGAAGCCC GGTACCATGC GATCGTCAGC
GACGGCCTGC CCGCGCAGGA AGCCGAGTTC CGGCAGTTCT TCAACCAGCG CACGCACGAA
TCGTTCAGCA CCCTGCTGCA CCTGCTGGAC GAGGAACGCC GGTCCATTAC CAGCCGCATC
CTGCCCCTGA ACGGCATCCT GTCCGAGGTC AACTTCCACG AGGGCAGCTT CCTGGAACTC
GATATCAAAC AGACCCTGCC GCCCACCGCG AAGCAGTTCA AGGACGCCAT CCAGAATGCG
CTCAGGACGC GTCACACGCG GCCCTCGCGC GCTGCGGGAG CGACAGCTGC CGGCGCCGAA
ACGGACGACG ACGCCGAGCT CACCAACCGC TACAAGTCGC TGGAAACGCT CGTGAAGCGA
CTGGGGTCGC AGACGCCGGA GGACCGGCGC TGGCGGGCCG AGGTGCTGGA TGTGCGCGGG
CACCTGTTCA TCCAGTGCAA GGAGCACCGC GAAGTACTTG GTCCGCGCGG CGGCAAGCGG
ACGGACGTGT TCATGCACGC GGATACGGGT TCCATGTCCG GCGGCGAGCG GCAGCGCTTC
ACGGCGTTCA TCATGGCCGC GGCGCTGAGC TATCAGCTGG GCATCGCGGA GCAGGGCTTC
ACCACCTACG GCACCGTGAT GATGGACGAG GCATTTGTCC TTGCCTCGGA GGAATTCGCC
GGTGCGGGCA TCAAGGCGCT GCATGAATTC GGCTTCCAGC TGCTCCTGGC CGCTCCGGAG
AATGTGATTG ACCTGTCCAA GCACCTGGGC TCCGTCACGG AAATCCTGCG GGACAAGCGC
ACCAACCGCT CCGGAGTCCT CACAGCTCCC GTGATCGGGC CGCGGGCGGG CGCTGAAGGC
CAGTGGAGGT CCGAGGCGAA CCCGGTGGAT ATCGTCCTGC GCTAA
 
Protein sequence
MSIATMLPMG ELTNPGQMRL ALVQVVNWGT FHGAHTMHVD RNGTLLTGNS GVGKSTLFDA 
MLRVFDARPR SNEAAAQRSG AVEDKRTTFT YMRGKVGDKA VGEGSASAFQ RPGATWSAVA
LTFDNAAGTK VTVSALFDLP KNGTESSVGR YYLIDNKPLD LAAIEGIAEK RFTKAALDTI
FPDAQVFDVH KAFAERFRRL LGINSDQALP LLRVIQAGKG LGGSVNTFFR DQVLDAPATL
SAADDVVEEF SNLMSIRQRL EDVRQQRDQL APVPGLNKEY AQSLLDANRL RELAGEEFDA
YKQQLAVTVH EKTLVRYKDL ARAKAKDLAA ERAVRDGLAK DLRQLEADYN NQGGNAISAI
EQSLENARVG LKLRQQVEEA ARQALADAGL ELAWSAEGWE QAHTHAADRS AELQGDSEAL
KELRFEAFDG HATKKRELAA AQQELVSLKT RKSLLPPSSI ENRTAIAAAT GVPEEQMPFG
GELIDVAEGE EQWRPAAERA LRNLATTLLV PGEHFAAVTR YLNNNSVRGA LRAVDVSKPL
AGGALAVENA ADGDLLTKLD ILTTGPAADA GQWIRERIAV DFAYPCVEDP DELAALDKGL
SLGGVVKRNR HTVEKDDRFA SRQDYVLGFD NASKLELVAA QVEDLQQELA KAAELAQSRE
ESHQGMTRQL EALRRIAEDN RPWEQVSAAV AEDELGKIEQ RLKDALAAQA DLEPLRANIE
AARQKHQSST EAAAVLQSEY KALDHQLTAA DSLLEAARTR LRQAPPSDAT VAALEQYCAD
FGGVDDVAEM HELDNLAHQV RTRLLAELHA AESRGQATSE RLTRIFEGFV REWGTAISAD
HGTSIGAAGE FEARYHAIVS DGLPAQEAEF RQFFNQRTHE SFSTLLHLLD EERRSITSRI
LPLNGILSEV NFHEGSFLEL DIKQTLPPTA KQFKDAIQNA LRTRHTRPSR AAGATAAGAE
TDDDAELTNR YKSLETLVKR LGSQTPEDRR WRAEVLDVRG HLFIQCKEHR EVLGPRGGKR
TDVFMHADTG SMSGGERQRF TAFIMAAALS YQLGIAEQGF TTYGTVMMDE AFVLASEEFA
GAGIKALHEF GFQLLLAAPE NVIDLSKHLG SVTEILRDKR TNRSGVLTAP VIGPRAGAEG
QWRSEANPVD IVLR