Gene Arth_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4271 
Symbol 
ID4443439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp2355 
End bp4136 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content57% 
IMG OID639687592 
Producthypothetical protein 
Protein accessionYP_829289 
Protein GI116662235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCGC GCACCATCGA TCCCCGCATC AGCCTCGCAC TGTCCGTACA AGCCCAGCCC 
GGTGTTTATG CCCTCCTTAT CGGTTCCGGA ACTTCCACCG GGGCCGGGAT CCCCACAGGC
TGGGGAGTCA TCAAGGACTT GGTGCGCCAA GCCGCAGCAG CAGAGGGCAC AACACTGAGT
GCTGACCCCG CCGACGAGGA AATAGATGAA TGGTGGGTTA ACCATGGTGA TGGCAACGAG
CTCGGATACT CCGGGCTCCT TGAATCACTA GGCCGAACGC CTGCCGCCCG CAGCGCACTC
CTGCACAGCT ACTTCGAACC AAACGACGAG GATAGAGCGG ACAACCGCAA GGTGCCCGGT
AAGGCACACC ACGCAATTGC TGAACTCGTG CAACGAGGAG CCATACGTGT TATCCTCACC
ACCAATTTCG ACAGCCTCAT CGAGCAAGCA CTAGATCAGG CGTCCGTTCC GTACCAAGTG
CTGTCCTCAG AGAGTGCTAT CAAAGCGCGA AAGCCACTGC ACCATGCTGA CTGCACTGTC
ATAAAGCTTC ACGGGGATTA CAAGTCACTT GACCAGAAAA ACACGCTGGC CGAGTTGACC
GACTACGGGC AGGCAACACG TGAAATTCTC CACGAGGTCA TGGAAAACTT CGGGCTGATC
ATTAACGGGT GGTCAGCCGA CTGGGACAAA GCATTGGTAG AGGCGCTGGA AGGCCGCCAG
AGTCGCCGCT ATCCGTTGTA CTGGACTACG TTGTATGGCC CAGGGCCTGC GGCCGCCGCG
CTGATCGAAC AACATGGAGC CGCCGTTATC AGCGGCGTCA CCGCCGACGA ATTCTTCCCA
GACCTGCAGC AACGGCTCGA ATCCCTAGAT TCTCTTGCCG CACCGCAGCT AACTGAAGAT
ATGGCAATCG CCCGCCTCAA GAGGCTCCTT CCGTACAGGG AGTCCTACAT CGAGATACGT
GAACTCTTGA CCAGTGAAAT CAGAACGCTG GCCTCGTACA TCCGTGAGCG CGGGGGTTCC
TTTCCTCCTG GTGGAGATTA CGCAACTGCC TTCGATAATG AATGTTTGTC CCTCCGTAAC
CGCTCGCAAA CTCTGATTCG TCTAATCGCG ACCGGTGTGG CATTTGACCG TGACCGCATT
CACGGTGACC TTTGGGTGTG GGCGGTTCAG CAGCTCATGA AGGCGAGAGG GCAGGTTTCG
TCCTTTCAGG AAGGCTGGTT CAATTTGGCC CATTACCCTG CCCTATTGGC TCTACGGGCC
ATAGCGATGA TTGCTGTCAC CGAAGACCGC GAGGACGTGT TTATCCGAGC GGCGAGTGAG
CCGAAGTGGA AGGATGCCTA TTCCGGCCGC GATCCCGAAC CTGCCTTCCT GGTTCTGCAG
GACGAGAGGG TTGTTTCCTA CGACTTAGCA AAAGCAGCGC CGCGATGGAA CGGGACGCAG
TGGATGTATC CGCAAAGCGA GCTGATTTCA GATGACATGC AAGCCCTGAT AGGCCATCTG
GTTGGATCTG GCGACGATTA CAAGAAGGCC TTTTGTCAGG CCGAGTACCG CATGGCGTTA
GCTCACGTGT TCCTTACTAC TCGATCAAGC CGTCCCTCGG CAGGCAAGTA TTGCTATGCA
GCCACGCGTG GTGGCGACAA GAACATGTGG CAAAAGGACT TCGAACTCAA CGGCGACCGT
CAGGCGTGGC GTTGGTTACC GTCCCCTGAT GGAGAAGCAG ATCCTTTCGC CACAAAACTT
GACGAACTCG CCACGGTCCT AGCCAGGCTG GAGCGCTGGT AA
 
Protein sequence
MTPRTIDPRI SLALSVQAQP GVYALLIGSG TSTGAGIPTG WGVIKDLVRQ AAAAEGTTLS 
ADPADEEIDE WWVNHGDGNE LGYSGLLESL GRTPAARSAL LHSYFEPNDE DRADNRKVPG
KAHHAIAELV QRGAIRVILT TNFDSLIEQA LDQASVPYQV LSSESAIKAR KPLHHADCTV
IKLHGDYKSL DQKNTLAELT DYGQATREIL HEVMENFGLI INGWSADWDK ALVEALEGRQ
SRRYPLYWTT LYGPGPAAAA LIEQHGAAVI SGVTADEFFP DLQQRLESLD SLAAPQLTED
MAIARLKRLL PYRESYIEIR ELLTSEIRTL ASYIRERGGS FPPGGDYATA FDNECLSLRN
RSQTLIRLIA TGVAFDRDRI HGDLWVWAVQ QLMKARGQVS SFQEGWFNLA HYPALLALRA
IAMIAVTEDR EDVFIRAASE PKWKDAYSGR DPEPAFLVLQ DERVVSYDLA KAAPRWNGTQ
WMYPQSELIS DDMQALIGHL VGSGDDYKKA FCQAEYRMAL AHVFLTTRSS RPSAGKYCYA
ATRGGDKNMW QKDFELNGDR QAWRWLPSPD GEADPFATKL DELATVLARL ERW