Gene Arth_4277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4277 
Symbol 
ID4443528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp10081 
End bp11700 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID639687598 
Producthypothetical protein 
Protein accessionYP_829295 
Protein GI116662241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGCA GCAAGACCAA GCACCAGCGC ACACAGGCCC CGGCCCCGGC CACGAAGCTC 
GCCGCCGCGG TCGCCCGCCG CCCGGGTCTG CGGGGCTGGC GCGGCCGCGG ACAGGGCGAA
GCCGTCTACG TCACGGCAGC CGACGAATGG CGCGGCACCT CGGTCCAGGT CTGCGGTCTC
TGGCCGTTCG TGGGCGGGTC CGGCACCCCG ATCCTGGGAG TGCCGATCGG TGTTCACACC
GACACCGGAG CGCCCTTCGG GGCCGATCCG ATCAGCTGGT TCATGCACGG CATGATCAAC
AACCCATCCA TGTTCGTCCT GGGCCTGCCG CACTGCGGCA AGTCCACCCT GGTCCGGCAC
ATCGTCCTGG GCCTGGCAGG CCGCAGCATC AACCCGTTGA TCCTCGGTGA CCTGAAGCCG
GACTACGTTG ACCTGATCGA AGCACTCGGC GGGCAGGTCA TCAGCCTGGG TCCTGGACGC
GGCCACTTGA ACGTCCTGGA CCCCGGTGAA GCCACGGGAG CCGCCGTCAG GCTGCTCGCC
TCGGCCGAGG AACACCGGGC CAAAGCGAAC CGGACCCTGA ACGATCCAGA TGGCGACCAG
GCAGCGGCCG CGCAGCTGCT CGCTGCCGCG GTGAAGGCGG AGAAGCTGGC CGAACAGCTG
ATGGCGGACT CGCACAACCG GCGCCTGAAC ATGATCACCG CGCTGATCAC CATCGTGCGG
AACAGCAAGC CCAGCGCCCA TGAGGAATCC CTGATTGACA GGGCCCTGCA CATCCTGGAC
GAGCGCCTGG ACCGCGTTCC CCTGATCGGG GATCTGATCG AGGTCATCAA AGACGCCCCC
GACGAACTGC GCGCGATCGC CCTGGACCGT GGCGATATGA GCCGCTACCT CGACGCGACC
GACCAGCTGC GCCAGTCGCT GTACTCCCTC GACGGCTCGG GCCGGTTCTC GGACATGTTC
TCCCAGCCCA CCGACCAGCC GATGGACCTG ACCAAACCCG TGGTCTTTGA CCTGTCAGGG
ATCTCCGACA CGCAGCGCGA CATCCAAGCG GCCTGCCTGC TGGCCTGCTG GTCAACAGGG
TTCGCCACCG TCCAGGTCAC CCACACCCTG GCCGACGCCG GCCTGGAACC GCGCCGGAAC
TACTTCGTCG TCATGGACGA ACTGTGGCGG GCCCTGCGCT CCGGTGAGGG CATGGTGGAC
CGGGTTGACT CGCTGACCCG CCTGAACCGG ACCGAGGGCG TCGGGCAGGC CATGATCACG
CACACCATGA GCGACCTCGA AGCTCTGCCA ACCGAATCCG AACGGATGAA AGCGCGCGGA
TTCGTCGAAC GCTCCGGCAT GGTCGTCTGC GGCGCGCTCC CCGGAGCCGA AATGGAAAAA
CTCAACAAAG CCGTCACCCT CTCCCGGGCC GAACAAGCCC GCCTCATCTC GTGGGCTGAC
CCGGGGTCAT GGACCGACGT CGGCGGATCC CGCAAACGCG TACGCCCGGG CCTGGGCAAA
TTCCTCTTCA AAGTCGGCGG CCGCCCCGGG ATCCCGGTCG CCCTCAAACT AACCAGCGTC
GAGGAATCGC TACACGATAC AAACAAGCGC TGGACGATCA ACAGTGAGAA GTTCGCGTGA
 
Protein sequence
MFGSKTKHQR TQAPAPATKL AAAVARRPGL RGWRGRGQGE AVYVTAADEW RGTSVQVCGL 
WPFVGGSGTP ILGVPIGVHT DTGAPFGADP ISWFMHGMIN NPSMFVLGLP HCGKSTLVRH
IVLGLAGRSI NPLILGDLKP DYVDLIEALG GQVISLGPGR GHLNVLDPGE ATGAAVRLLA
SAEEHRAKAN RTLNDPDGDQ AAAAQLLAAA VKAEKLAEQL MADSHNRRLN MITALITIVR
NSKPSAHEES LIDRALHILD ERLDRVPLIG DLIEVIKDAP DELRAIALDR GDMSRYLDAT
DQLRQSLYSL DGSGRFSDMF SQPTDQPMDL TKPVVFDLSG ISDTQRDIQA ACLLACWSTG
FATVQVTHTL ADAGLEPRRN YFVVMDELWR ALRSGEGMVD RVDSLTRLNR TEGVGQAMIT
HTMSDLEALP TESERMKARG FVERSGMVVC GALPGAEMEK LNKAVTLSRA EQARLISWAD
PGSWTDVGGS RKRVRPGLGK FLFKVGGRPG IPVALKLTSV EESLHDTNKR WTINSEKFA