Gene Arth_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3784 
Symbol 
ID4447834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4267394 
End bp4269055 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID639691608 
Producthypothetical protein 
Protein accessionYP_833259 
Protein GI116672326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATGG CACCCCCGGG GGCAACATGG CACGATATGG ACCAGCCACC ACCTCGCAAC 
TGCATCATTC TCTGGGGAGA CATCATGACT TTTTACGGTG CTGACGTCAG TCAGCTTCGC
GCCTTGGCCA AGGCCGCAGA CAAAGCAGCT TCACTGCTCA GCACCAGGGC GTCCTCGCTG
CAGAGCCAGA TCCTTGCCGC CCCCTGGAAA GGCGGCGACG GCGAGCGTTT CCGGCAGGAG
TGGACGGGCA GCCACCGTCC CAGCATCGAA CGGGTGGTTG CAAGCCTCCG GCAGAACTCT
AAAGTCCTGC TCCAGAACGC CGCGGAGCAG GAAAAGTCCT CGGCGGCCGG CACGGGCGCC
ACCAGCGCAG GCCCGGGCGG GCTGAAGGGC CTGTTCGACC AGATAAAGAA CTGGGCGCAG
GAAAAGCTTG AGGCTGCCCG CGAGGCGGCG GAACACCGGG CCGAGCTTCA GGGCCAGCTG
GACACGATGG CCGGGGCCAG CCCGGAGGAA CAGGCCAAGT GGTGGGACGG TCTCTCGGCC
GCCGACAAGA AGTACCTGAT CGAGGGCGAA GGCCCGGACG GTCCGCTGGC CAAGGACCTG
ATGGCCATGG ACGGCGGAAT CCCGGAAGCA GCGCAGGACC TCGCCCGGCA GCACCTGCTG
GAGCTGGCCA AGGAAGACAT CCCGGTCTAC ACGGAGACGG GCAAGGCCTC CATCGAAGCC
CGTGTCCTGT GGGTCCACGG CGGCGCCGAA GTGGGCACCG AAGTGGTGGA AAACGCGGAC
GGCTCGGCCA CCGTGAAGGT CTACGGCAAC ATGGGCCTGG GCGTGAACGA CGCTTCCGGG
ACCGCCGGGG CCACCCTGAC GGGCGAGGTG GCCCGCGAAT ACACGTTCGG CAGCATTGAA
GAAGCCATGG CGGCGCGGAA CCAGATGTAC GCCGACCTCC CGCCGGACAG TGCGGGTGAC
ATCAAGGACG TGGCCGGCAA CGCGCCCGAC TACATCCTGG ACACCCTGGA CGAGGCCGCC
TCGGACAACG GCTCCAACGG ACACGAGGAC AAGGCCAAGG GATCGCTGAG CTTTGAAGCC
GGTGCCGATT CGGCTGATGC CGCCGCCTCC GCGAAGCTGG AACTGGCCTA TGAAAAGAAC
CTCAGCGACG GGACGTCCAA GGGGAGCGTG GAAGTCTCCG CGAAGGGCAA CCTGGACCTG
GACGGGCGGA CGTTCGAGGC TTCGGGCAAG GGTGGCCTGG AACTGAACAT GGACAAGGAC
AACAACCTCA GTTCGGTGTC CCTGTCCATG GAAGGTACCG TGGCGCAGGG CGTCAAGGAG
GGTATGGACG TCAAGGCCGG CAACGTCGAG TCCAGCGTGA CGGCGGGCAC CCAGGGCACC
GTCAAGATCG ACATCGACTA CACGCCGGAA AACAGCGCCG TGATCGACAG CTACATGAAG
AACGTGGCGC TCGGGAACGA CGCCGCAGCT GCCCGGGACG CCGCGAAGCT CTACGAGGCG
GGCTCGGCTA CGGTACAGGT GAACAGCGTG GTCACTGCGT CCAACGAGGC CGGGTTCGAC
GTCAAGGCAG GCGAGGTCAA GGTCAGCACC GAAAACCAGG TCAGCACGAA CGTCAGCACG
TACCAGAAGG TCCCGAACGA CACCAAGCTC TCCCGGCTGT AG
 
Protein sequence
MPMAPPGATW HDMDQPPPRN CIILWGDIMT FYGADVSQLR ALAKAADKAA SLLSTRASSL 
QSQILAAPWK GGDGERFRQE WTGSHRPSIE RVVASLRQNS KVLLQNAAEQ EKSSAAGTGA
TSAGPGGLKG LFDQIKNWAQ EKLEAAREAA EHRAELQGQL DTMAGASPEE QAKWWDGLSA
ADKKYLIEGE GPDGPLAKDL MAMDGGIPEA AQDLARQHLL ELAKEDIPVY TETGKASIEA
RVLWVHGGAE VGTEVVENAD GSATVKVYGN MGLGVNDASG TAGATLTGEV AREYTFGSIE
EAMAARNQMY ADLPPDSAGD IKDVAGNAPD YILDTLDEAA SDNGSNGHED KAKGSLSFEA
GADSADAAAS AKLELAYEKN LSDGTSKGSV EVSAKGNLDL DGRTFEASGK GGLELNMDKD
NNLSSVSLSM EGTVAQGVKE GMDVKAGNVE SSVTAGTQGT VKIDIDYTPE NSAVIDSYMK
NVALGNDAAA ARDAAKLYEA GSATVQVNSV VTASNEAGFD VKAGEVKVST ENQVSTNVST
YQKVPNDTKL SRL