Gene Arth_4337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4337 
Symbol 
ID4443483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp76144 
End bp77301 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content62% 
IMG OID639687658 
Producthypothetical protein 
Protein accessionYP_829355 
Protein GI116662301 
COG category[R] General function prediction only 
COG ID[COG4469] Competence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.478292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGCG AATCAACGTC GGAGCGCCGG GATATCTACG CGGTCCTTGG TTCCCCTGAC 
TCGCGCTTTC CCGTCGAAGC GCCGGCAGAT CCGAAGGAAG CCCGCCGGCT GAAGGGCGGA
AACAGTTTCT ACTGCTCGAC CGCACTTGGC GGGTGCGGAG GGGAACTGAC CTTTGCCATC
GGCGACGTCA ACATTCCCCA CTTCAGGCAC CAGGCGGGAA GCAGATGTTC ACTGATCTCG
TCCAAGACCC TGGCCGACCG CTACACCCAC CTTGCCATCC AAGAAGCATT GCGGGCGTGG
ATCGAAACCA TGCCGGGCTT CTCCTGCCGC CTTGAGGTCT CCATAGAAAG CGGACGCACG
GACGTCCTGG TGACAGGTCC GTCCTTCGAG GTTGCCCTCG AAGTGCAGCG TTCAGCGCTG
TCTGCCCGCA ACGCCCTGGA ACGGACGGCG GTCTACAGCC ATAGAGCCAA CGCAGTGCAA
TGGCTGTACG CATTCAGGGA CATCGATGCC TACAAGGCGG AGCTTGCCGA CCGCGGATGG
AGCCTGAGGA TCTGGTACGG GTGGGCCAAG AAGGAATGCA GGATCGGCGT CAGCTACGAA
ACCGAAACCG GCGCGGAGGT GGAAATAAAG GAAGCCGGCG GACCGCTGAC AGACTGGGAC
ATATCCTTTC GTGGTTTGGA CTCGGTCCAC CTCCGCAAGG CAAAAGCGGC AGTGGAGCGC
CTGAGAGCGA CGGAGCGGGA GCGACGCCTG GAGCAGGCCA GGGAGGAGGC TGCCCGCGAA
GCCGCGGAAA AAGCGCGTAA AGAGGCCGCG CGGCTCCGTC ACATCGCAGA CCAGAGAGCA
GCGCACGAGT CTCTTCTCAG GGCCTTGCAG CACACCCCGG AGGGGCTGGA AAACAAATGG
CCATCGTCGT GGCCCCAGCT TAAAGGCAGC CCGGGACAGG TCTCATGGGC AGAATCGATT
CGTGCCCGGG CCGTCGCTTT GTTGCGTGAA GAATTGGTCG AGGAGTGGCT TCCCCAAGCC
AGGGGAGTAC CGGTTGCGCG GTGGCTGGCT TTACAATCAT CGGCGGCATT CTGGATTCAT
TGCCGCTTCA ATGACACCTT TGCTTTTGTT CAAGCGTACG AGCACCAATT CGGATCCCCG
TGGCACCCCC AACGTTGA
 
Protein sequence
MDSESTSERR DIYAVLGSPD SRFPVEAPAD PKEARRLKGG NSFYCSTALG GCGGELTFAI 
GDVNIPHFRH QAGSRCSLIS SKTLADRYTH LAIQEALRAW IETMPGFSCR LEVSIESGRT
DVLVTGPSFE VALEVQRSAL SARNALERTA VYSHRANAVQ WLYAFRDIDA YKAELADRGW
SLRIWYGWAK KECRIGVSYE TETGAEVEIK EAGGPLTDWD ISFRGLDSVH LRKAKAAVER
LRATERERRL EQAREEAARE AAEKARKEAA RLRHIADQRA AHESLLRALQ HTPEGLENKW
PSSWPQLKGS PGQVSWAESI RARAVALLRE ELVEEWLPQA RGVPVARWLA LQSSAAFWIH
CRFNDTFAFV QAYEHQFGSP WHPQR