Gene Arth_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3914 
Symbol 
ID4444554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4409812 
End bp4411830 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content68% 
IMG OID639691739 
ProductNHL repeat-containing protein 
Protein accessionYP_833389 
Protein GI116672456 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA CCGTACGGAC CCACCACCGG GTCCGCGCCT CCGAACTGGT TGGCCGTAAC 
TGGCTCAACA CCGGCGGGAA GTCCCTGGAC TTGGAATCCC TGCGCGGCAA GATTGTGCTC
CTGGACTTCT GGACCTTCTG CTGCATCAAC TGCCTGCACG TACTCGATGA GCTGCGGCCG
CTGGAAGAGA AGTACTCGGA CGTCCTGGTG ACGGTGGGCG TGCACTCCCC GAAGTTCGAG
CACGAGGCCG ACCCCGTTGC CCTCGCCGCC GCCGTGGAGC GCTACGAGAT CCACCACCCG
GTCCTGGACG ACCCCGAACT GGAGACCTGG AAGGCCTACA CCGCCCGCGC CTGGCCCACC
TTGGTGGTCA TCGATCCCGA GGGCTACATC GTGGCGCACC TTTCCGGTGA AGGCCACGCG
GACGGCCTGT CCGTGCTGAT CCCGGAACTT ATCGCCCAGC ATGAGGCCAA GGGCACCCTG
CACCGCGGCG ACGGCCCGTA CGTTGCGCCG GAGCCGACGT CGGGCACCTT GCGCTTCCCC
GGAAAGGCAC TCTTCCTCCC GGCCGGCCGG GGCACCTCCG CCATCGGCGG ACCGGAGGCA
AAAGCGTCCG ACGCCGGGAC TTCCGACGCC GGCTCCTGGC TCGTCACCGA CACCGGCCAC
CACCGCTTGG TGGAGCTGGG CACGGACTTC CACACGGTGC TGCGCACCTA CGGCTCCGGG
ACCAAGGGAC ATGCCGACGG CCCCGGCGCC GACGTCGACT CCGTCAAACC CACCGCACAG
TTCAACGAAC CGCAGGGCCT GGTCCTCCTG CCGGAAGACG TGGCAGCGAA GACGGGCTAC
GACGTCGTAA TTGCCGACTC CGTCAACCAC CGCCTGCGCG GGCTGTCCCT CGCGGACGGA
ACCGTCAGCA CGCTCGTCGG CAGCGGCGTG CAGCGCCTGC TCGAGACAGG GCCGGCGCGG
GTGGACGAGG ATGCGGCTGG CTTCACCGGA CGCCTCAGTG ACCATCCCTT GGACGTTGCC
CTGAGCTCGC CGTGGGACGT TGTCTGGTCG GCCAAGCTCA ATGCCGTGGT TGTTGCCATG
GCCGGCGTCC ACCAGATCTT CAGCTTCGAC CCGATCTCCG GAGCCGTATC AATTCTTGCC
GGTAACGGCC TGGAAGGGCT CCTGGACGGC GCCGCGCACG AAGCCTGGTT CGCCCAGTCG
TCCGGCCTGG CCGAGGACGC TGACGGCAAC ATCTGGGTGG CCGACTCGGA GACCTCCGCG
CTGCGCAAAC TGGTAATTGA CGACGCCGGC ACGGTCACCG TGGAGTCCGC CGTCGGCAAG
GGCCTTTTCG ACTTCGGCTT CCGCGACGGT CCCGCCGCGG AAGCACGTCT GCAGCACCCG
CTCGGCGTCA CCGTTCTGCC GGATGGATCC GTGGCCATCG CCGACACCTA CAACGGTGCC
GTGCGCCGTT ACGACCCCGC CACCGGGACG GTGTCCACGC TCGCCCGCGG GTTGTCCGAG
CCCTCCGACG TGATTGTGGA CCACACGCAC ACAGCCGGCT CGGAGCCGCT GCTGGTGGTC
GTGGAAGCCA ACAAGCACCA GCTGATCTAC GTGCCCATCC CCAAGGAGGC CCAGCAGGTG
GACGAGGGCG CGGTGCAGAC GCACCGGCCC AAGAGTCCGG TCGCTCCCGG CACACTGGAG
CTGACCGTTC GGTTCACGGC ACCCACCGGG CAGAAGCTCG ACGACCGCTG GGGGGACCCC
ACGCAGCTGA AGATTTCCTC AACGCCGCCC GAGTTGCTGG TTGCCGGCGG TGGAACCTCG
GTGGGGCTGC AGCGGACGCT GGAGCTCTCC GCCGACGTTC CCGACGGCGT GCTGCATATT
ACGGCGCGCG CCGCGGCCTG CGACGGCCCC GAAACGGCCG ACGGCGAGAT CCCCGACCAC
GCTGCCTGCC ACCTGTACCA GCAGGACTGG GGCATCCCGG TGGTGCTGCA GGCCGACGGC
GACACTGAGC TGGTCCTGGA CCTCCGCGGC ATGGACTGA
 
Protein sequence
MSETVRTHHR VRASELVGRN WLNTGGKSLD LESLRGKIVL LDFWTFCCIN CLHVLDELRP 
LEEKYSDVLV TVGVHSPKFE HEADPVALAA AVERYEIHHP VLDDPELETW KAYTARAWPT
LVVIDPEGYI VAHLSGEGHA DGLSVLIPEL IAQHEAKGTL HRGDGPYVAP EPTSGTLRFP
GKALFLPAGR GTSAIGGPEA KASDAGTSDA GSWLVTDTGH HRLVELGTDF HTVLRTYGSG
TKGHADGPGA DVDSVKPTAQ FNEPQGLVLL PEDVAAKTGY DVVIADSVNH RLRGLSLADG
TVSTLVGSGV QRLLETGPAR VDEDAAGFTG RLSDHPLDVA LSSPWDVVWS AKLNAVVVAM
AGVHQIFSFD PISGAVSILA GNGLEGLLDG AAHEAWFAQS SGLAEDADGN IWVADSETSA
LRKLVIDDAG TVTVESAVGK GLFDFGFRDG PAAEARLQHP LGVTVLPDGS VAIADTYNGA
VRRYDPATGT VSTLARGLSE PSDVIVDHTH TAGSEPLLVV VEANKHQLIY VPIPKEAQQV
DEGAVQTHRP KSPVAPGTLE LTVRFTAPTG QKLDDRWGDP TQLKISSTPP ELLVAGGGTS
VGLQRTLELS ADVPDGVLHI TARAAACDGP ETADGEIPDH AACHLYQQDW GIPVVLQADG
DTELVLDLRG MD