Gene Arth_4288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4288 
Symbol 
ID4443539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp20174 
End bp22039 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content62% 
IMG OID639687609 
Producthypothetical protein 
Protein accessionYP_829306 
Protein GI116662252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCGG AACCGGTATT GGCGTCAACG CACGCCCTTG TGGGTGTCGC TCCACATCCG 
AGACCAGAAG ATGCTTGGAG TCTTGCTCCG CTGATTGCCG GCACCCCTTC GTACCGTTTG
GGGCGTTGGA TTGGGCAGAG GTTTCAATAT CCTCGGCCGG AGCAGGCGCC GAGGATCACT
CGCTCTTTGC CGGAACATCC TGCCGCTGTG ATGGTGCACG GCGCGGATGG CCGCGTCTCC
ACGCTGTGCC TGGACTTTGA CACCTCCAAG GCACTGAAGG GCGTGGTGGA CTCGGACGCG
GTCCGGGTGG GTCGGTTGTT GGCGGAGTGC GGCATGAGGT TTGTGGAGGA TTTCTCCCCC
AGCGGTGGAC GACACCTGTA CGTCCCGCTG CAAGACCGGA TGGACGCGGC CGAGGCCCGG
GAACTGATCG AAGCGCTAAC CCTGATGGCC AGCAGTCTTG ACCCAAGCCC GCACCAGAAC
ATCACCGACG GGTGTATCCG TGTCCCAGGA AGCATCCACA AGTCCGGTGG CCATCAAACC
CTCATCACTC CCCTAGCCCA GGCGTACGGC ATCCTGCGCC GCCGCAACCC AGCGCAAGGA
TTGGTGAAGC TGCGCGCGGC GTTGGCGCCG GAACTGGCCC GCAAGCGCGT GCTCAAGGCC
CGGGTAGCGA AGACCGTGGC CGTTCAACGG ACCGGCGGGC TACCGGCGCT GCTACTGGAG
TCCCGTAGCG AGACCCCGCT ACGCCGGATC GCCCGGACCG GCTTGTACGA CACGGCTAGG
TACAAGAGCC CGTCCGAGGC CCGGATGGCG GTGCTGAACC ACTTCAGCGC CTGTGGCTGG
TCCCTGCAGC ACGTGAAGAA CGAACTCGCT GGCCAGTTCC CAGGCCTGGC TGCCCTCTAC
GGATCTGCAG AACGCCAGGC AAGGCTCCTT CCCTACGAGT GGGCCAAAGC CCAGGCCTTC
ACCAGCACAG CTTCCAGCAA GCGAAAACCA CTCCCCCCTC AACGGAGGGA AAAGAGTGCA
CTTATTAACA ACACGAGCCC GACTTTACTC ACAGGGGGGG CGGATAAGAC TAGCAGCGTC
GCTGTGCACC AACTTGTGAA CGATTTGGAA AATGTTCTCT ACGCAGTTCT CGACCACAGC
CTCCAAAAAC GCGGCCGTGA AGGCCTCAGC CTGCGCTTAC TGATCCGCGG ACTGCTGGGA
TACATGCGGG CCAAGGAAAC TGACATTCTC GACGTCGGTT GCCGCACTCT TGCTGTAGCG
ATGGGCAAAC ACCATGTCAC CATCGCCAGG CTGCTGCCTA TTCTGGTTCA GGCTTCTGAC
GGGATCCTCA CCAAGATCGC GGACGCACGT CAGAAGGCGG CCGACGTCTA TCTCCTCCAA
CTCCCCGAGC GTTATCAACA GCTTGCACGG GAACTGACCT GGAGGAAAGG AAAAATTCAC
GCCATCCGAC CCGTGTTTCG GGCTTTGGGA GACGCCGCTG CCCTGGCCTA TGAGGCAATT
GAACGCGGCC GCTACTCCCT CACGACCGCA GAAGTCGTCC GTAACAGTGG CGTCAGCCGC
AACGCCTGCT CGGCCGCCTT GGCTGAGATG GAAACGCTCG GCATGATTCA GCGTCACGGC
GGAACCTGGA GAACCACCGC GGTGAACCTT CGCACGCTTG CCGCGAGGCT CGGAGTGCTC
GACGACTACC TTGACCACAT CCGCCGGAAC CGCCATGAGC GGGCGATCTG GCATGCCTAC
CTCGAAAGGT TTAAGAGGAC ACTGCAGTTT GGGATCGTTG AAGCAGACAT GTTTGATCCT
GAAAGGGATG AATATTGGCC ACCACCGGAT GATGCGGCGG TGTGGCGGTT ACGCCAGTCG
GCATAA
 
Protein sequence
MVPEPVLAST HALVGVAPHP RPEDAWSLAP LIAGTPSYRL GRWIGQRFQY PRPEQAPRIT 
RSLPEHPAAV MVHGADGRVS TLCLDFDTSK ALKGVVDSDA VRVGRLLAEC GMRFVEDFSP
SGGRHLYVPL QDRMDAAEAR ELIEALTLMA SSLDPSPHQN ITDGCIRVPG SIHKSGGHQT
LITPLAQAYG ILRRRNPAQG LVKLRAALAP ELARKRVLKA RVAKTVAVQR TGGLPALLLE
SRSETPLRRI ARTGLYDTAR YKSPSEARMA VLNHFSACGW SLQHVKNELA GQFPGLAALY
GSAERQARLL PYEWAKAQAF TSTASSKRKP LPPQRREKSA LINNTSPTLL TGGADKTSSV
AVHQLVNDLE NVLYAVLDHS LQKRGREGLS LRLLIRGLLG YMRAKETDIL DVGCRTLAVA
MGKHHVTIAR LLPILVQASD GILTKIADAR QKAADVYLLQ LPERYQQLAR ELTWRKGKIH
AIRPVFRALG DAAALAYEAI ERGRYSLTTA EVVRNSGVSR NACSAALAEM ETLGMIQRHG
GTWRTTAVNL RTLAARLGVL DDYLDHIRRN RHERAIWHAY LERFKRTLQF GIVEADMFDP
ERDEYWPPPD DAAVWRLRQS A