Gene Arth_3891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3891 
Symbol 
ID4445092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4381193 
End bp4383304 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content68% 
IMG OID639691716 
Producthypothetical protein 
Protein accessionYP_833366 
Protein GI116672433 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAACA ACGATGTCCG CTCGAAAGCA CGAACCAAAC CGCTGGCCAT CGCTGTGGCC 
GCGTCGATGC TCCTGGTGGG AGCGGGTGCG GCCTCCGCGG CACCTGGACC CGCCACGGAA
GCCGGACCGG CGCTGACCGC GCCGGTCCCG GCCGGCCCCG GCGCCCCGGC CGAAAGGGGG
CCGCTGACCA ACACAGCACA CCTGGACTTC CTGATGGATA CTGTCCCGCT CACCCCGGTG
GCGGGCCACA CCACCTACAA GCTGGACGAG ATGCCGTCCG CCCAGGCGCC GTGGACCTAC
GCCGACAAAG AAGCGGACGG AAGCTACGGA CGCGTCGGCG GCGGAGACCT CGACCCCGCC
ACGGGACACT GGACCCAGGG CGCCTACAAC GCCGATGACA TCGCACGTAC CGCGGTGGTC
TATCTGCGGC ACTGGCAGCA GACCGGCGAT CCCGCCAGCA AGGAGCGGGC ATTCCAGGCC
CTGCGCTCGC TGACGTTCCT CCAGACCACG GACGGCCCGA ACGCAGGCAA CGTTGTGCTG
TGGCAGCAGT CGGACGGAAG CCTGACCCCC AGCGCCAAAC CGGTGGAACT GCCGGACCCC
TCCGACTCGG CGGAGTCATA CTGGCTTGCG CGCACCGTAT GGGCACTCGG TGAGGGTTTC
ACGGCGTTCC AGGACGACGA TCCCGAATTC GCCGGTTTCC TGCAGGACCG GCTGCACCTT
GCATTAGGCT CGCTCAACAA ACAGTCCCTT GCAAAGTACG GCACCTATGA CACCGCGGAC
GGCGTGCAGG TGCCGGCATG GCTCATCGCC GGCGGCGCCG ACGCCTCGGC CGAGGCCGTG
CTGGGACTTT CCTCCTATGT GAGCTCCGTT CCGGGCGATG AGTTGGCGGC CACGGCACTG
GACCGTCTTA GCGAAGGCGT GGCCGCAATG TCCTCGGGTT CCGCCAGCCA GTGGCCTTTT
GGCGCCATCA TGCCTTGGAA CAAATCGCAG ACGCTCTGGC ACGCCTGGGG CGGCATGGCC
CCCGCCGCCG TCGCCACCGC CTCCGAAGTG CTCGACCAAC CAGCGCTGCT TGAAGCGGCA
GTCAAGGACA CCGCCCAGTT CACACCCCAA CTCCTCGCAG CGGGCGGACC GGACAACGCC
TGGAGTCCGA CGCCCGGCGA AGCACAAATT GCCTACGGCG TGGACTCCCG GGTCCAAAGT
CTCGTGGCCA CAGCCCGGGC CGCTGATGCG CCGGGCCTGC TGGACCTTGC CGCGGTCACA
GCGGGTTGGT TCTTCGGAGC CAACCCCAGC GGCGCCCCGG CCTACAACCC CGCCACGGGA
ACAGCGATCG ACGGCATAGA GCCGGACGGG CGGGTCAATC CCAACTCCGG TGCGGAATCC
ACCATCCACG CCCTCCTCAC CATGCTGGCG CTGGACGCTG ATCCGGCGCT GAAGTCGGCT
GCCGTGGGGA TAAACCGGAC GGTGTCCACA CGGGGCCTCA GCGTGGTGGA AGCCGAATCG
GGCACCATCA CCGGCGAGGG AACCGTGGTC AGGCCGTCCT CCGCCTGGAC CGGTGAGGCA
AACCTCTCCG GCGGCGCCTA CGTGGACCTG AAGGCCGGTG CCTCCCTGGA CATTCCGCTC
CCGGCCTCCG ACCAGCCCCA TAACATCTAC CCCATCGTGA ATCGGGGGAT AGCCCCGTCC
GGCAGCACCG CCTGGACGTC CGCCAAGGGC CCGCTGGGCA CCACGGAAAA CGGCGGGGCA
GGGGAGCAGG GCATCACGGA TGCCCCGGGC ATCCTCTTCC CCTACTCCCT CAAACGCGCG
CTGTCCGCCG GGCTGACCGC CGTCGTCGGC ACCACCAATG GAAACGTTGC GCTGGACGCA
CTGCTCCTGC AGCCGCAGAT CTCCACGGTG TCGGTGTCCG GGGCCGGCGG CGATTCCACC
CTCTATGTGA GCGCGGCAGG GTCCACAACC GTGAAGGAGG TGGACATTCC GGACGGATGC
ACCATGGTCC AGCGCCAGTA TGATTCTTCC GGTCAGCCGG TCGCAGGCGC GAAGAACGAC
GCCGGCACGC GGTCGGGGCG CGTGACGGTA GCGCCCGGGG GCTTCACCAT GGTGACGTTC
CTGGGGCACT AA
 
Protein sequence
MTNNDVRSKA RTKPLAIAVA ASMLLVGAGA ASAAPGPATE AGPALTAPVP AGPGAPAERG 
PLTNTAHLDF LMDTVPLTPV AGHTTYKLDE MPSAQAPWTY ADKEADGSYG RVGGGDLDPA
TGHWTQGAYN ADDIARTAVV YLRHWQQTGD PASKERAFQA LRSLTFLQTT DGPNAGNVVL
WQQSDGSLTP SAKPVELPDP SDSAESYWLA RTVWALGEGF TAFQDDDPEF AGFLQDRLHL
ALGSLNKQSL AKYGTYDTAD GVQVPAWLIA GGADASAEAV LGLSSYVSSV PGDELAATAL
DRLSEGVAAM SSGSASQWPF GAIMPWNKSQ TLWHAWGGMA PAAVATASEV LDQPALLEAA
VKDTAQFTPQ LLAAGGPDNA WSPTPGEAQI AYGVDSRVQS LVATARAADA PGLLDLAAVT
AGWFFGANPS GAPAYNPATG TAIDGIEPDG RVNPNSGAES TIHALLTMLA LDADPALKSA
AVGINRTVST RGLSVVEAES GTITGEGTVV RPSSAWTGEA NLSGGAYVDL KAGASLDIPL
PASDQPHNIY PIVNRGIAPS GSTAWTSAKG PLGTTENGGA GEQGITDAPG ILFPYSLKRA
LSAGLTAVVG TTNGNVALDA LLLQPQISTV SVSGAGGDST LYVSAAGSTT VKEVDIPDGC
TMVQRQYDSS GQPVAGAKND AGTRSGRVTV APGGFTMVTF LGH