Gene Arth_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4222 
Symbol 
ID4443588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008539 
Strand
Start bp55540 
End bp56913 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content58% 
IMG OID639687747 
Producthypothetical protein 
Protein accessionYP_829444 
Protein GI116662391 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGCG ATGATGATGA GAGCACGCTT GAGTTCATAG TCGACTCTCT TGAGGCCAAC 
GCCCTGGACG AATCCGATGA CTTTGAGTTT GATCCGGAAA CGGCAGAAAT ATTGTCTGAT
CCCGAATCGG ATCCTGGAAT CACCATCGAA GACACCGAGG AGGCCCAAGA CCCTGTGGCT
CAAGCGCTAA AGCCTGGGGC GCCCCTTTTC TTGCTCCATA AGAGCGCCCA CCACTCTGCG
ATCCTTCCTG CAGTGGCAGC CGATATAACA GTCAAGCATG GCGGAGCACT TCACGTTGGC
GTCGCCGCGT CCGTGCCGAG AACTACTCCG CAGGCCCTGA AACGATTCTT CGAGTCAATG
CCTTCCACGA CCCTTCGTTT CGCGGATCCG GAAGCTTTCG CACGACACGA TTCCTTGGGA
CCGTATATTG CCGCTCAACG GGAGGACAAG CCGTTGGTGG GCAGGACTGG CGCTCATTGG
AAATACTTTG GGGACCCACA GGTTGGAGGA AGGAATGCGA CCTGGGTTAA GGATGTATTG
GACGCCCAAC GATCCATGGG CGCGTCTGTC CTGCTGACCC CCGGAGTCTG GGCCGATCCC
ACAAGCGCTC AAACTGCACT TACGGAGGCT CGGCAGCACG CTTCATGGGC GCGCACTGCG
CTGACTCCAG GGGAACACCT TGCGGTAAAC ATTACGCTGT CATGCCAGTG GCTCACCAAT
ATCCACCTGA GAGACAAGCT CCTCAACGAG ATCCTCGACA TGGATGAAGA CGTTTTTTAC
ATCCGCGTCA GGTGGCCCTT GATGCCCCAG ACTTACGGAC AGCTCCTAGA CCAGGCCATT
CTTGATGGTT ATGTCGAGCT TGCCAATGTG TTTGAAGACA ACGACAAAGT GTTGATCCTC
CCTAACACGG GCCTCACCGG GTGGGCAGCA CTTGCTTGGG GAGCCCACGG CTACTCCACT
GGTATCGGCT CCGGCGAGCG AGCCTTTGCT GACACCCGCG TCATCCGGAT GAAAAGGACG
AACCCCCGAC CTGCCCCCAC GAACCGCACG TTCGTTACAG ATATCCTCCA TGTCACTGAC
GTGACCACTG CAACCCAGTT GGATCAATTG GCTGGCGGAG CTTGCCGATG CCGATTCTGC
GCAAGTCAGC GGAAACTCAC TCAGTGGAAC AAGGCACTTG CGGGAGCACA CTATCTGCGG
CAGGTGGCCG ATATTACGGC CACTATCTCA ACAAGCGCTC GAGGCCGCCG GGCGGGCGCC
CGTCGTATCG TTCGGGCCGC AGCCACCCAG GCTGCGACAG CCACGCGGAG AGTGCCCCTA
GCCGCGACTA ACGAACCAAA GCATTTGCCT CTATGGAGCG CCCGTCTGCG CTAG
 
Protein sequence
MSRDDDESTL EFIVDSLEAN ALDESDDFEF DPETAEILSD PESDPGITIE DTEEAQDPVA 
QALKPGAPLF LLHKSAHHSA ILPAVAADIT VKHGGALHVG VAASVPRTTP QALKRFFESM
PSTTLRFADP EAFARHDSLG PYIAAQREDK PLVGRTGAHW KYFGDPQVGG RNATWVKDVL
DAQRSMGASV LLTPGVWADP TSAQTALTEA RQHASWARTA LTPGEHLAVN ITLSCQWLTN
IHLRDKLLNE ILDMDEDVFY IRVRWPLMPQ TYGQLLDQAI LDGYVELANV FEDNDKVLIL
PNTGLTGWAA LAWGAHGYST GIGSGERAFA DTRVIRMKRT NPRPAPTNRT FVTDILHVTD
VTTATQLDQL AGGACRCRFC ASQRKLTQWN KALAGAHYLR QVADITATIS TSARGRRAGA
RRIVRAAATQ AATATRRVPL AATNEPKHLP LWSARLR