Gene Arth_2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2268 
Symbol 
ID4445311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2554647 
End bp2555747 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID639690077 
Producthypothetical protein 
Protein accessionYP_831748 
Protein GI116670815 
COG category[S] Function unknown 
COG ID[COG2339] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.37562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCGC CTGCCGCGGT GCGCAGCCGG TCAAGTGCGG GAGTTGTAGG ACTCGTGGGC 
GGCGGCGGCT TCCTGGCTTT TGCCAGCCTC TTCCTGGTCC TCCCCTACCT TGTTGGCAAC
ACCGGCGTCA CCGGGTTCGT GATCGGCTTC ATCGCGTCCC TGATCCCGTT GAGCGCCGTG
CTCCTCGCTG TCTACGTCAT TGACCGCTGG GAGCCCGAGC CCAAGCGCCT GTTGCTGTTC
GCCTTTATGT GGGGCGCGGT CGTGTCTATT TCCGTCACGC TGCTGATCCA GCCGGTTTTT
GCGCTGGCAG CGGTGCCCCC GGCCGGCGTG GACTACCGGA CTTTTGCCGT CACCGTGCAG
GCCCCCGTGG TGGAGGAGTT CGCCAAGTCC TTGGGCCTGT TGCTACTGCT TGTGCTGGCG
CGGAAGCACT TTGACGGCCC GGTGGACGGG GTGGTGTTCG CCTTCACCAT TGCCGGCGGC
TTTGCTTTTA CGGAAAACAT CCTCTACTTC GGCCGGGCCA TCGCCGAGTC CGCAACACCG
GGCACAGACC TCGCGGTCGT GTTCTTCCTG CGGGGCGTCA TGTCCCCGTT CGCCCACGCC
ATCTTCACGG GTACCACCGG ACTCATTCTC GGTTTTGCGG CGCGGCGCTG GCACCCGGGG
ATGTCCGTCG TTGCGTTCGC CGTGGGACTG GTCCCGGCGA TGATCCTGCA CAGTATGTGG
AACAGCATGG GCCAGGACTT CCTGGTTCAG TACATCGTGG TCCAGGTGCC CATCTTCGTG
CTGGCCGTCG TCGTTATTGT GCTGCTGCGT GTGGCGGAGA ACCGGCTAAC GCGGCAGCGG
CTCCAGGAGT ATGCAGCCGC GGGGTGGTTC ACGCCGCCGG AAGTGGAGAT GCTGGCAACC
GCCGGCGGAC GCCGTTCTGC GGTCCGCTGG GCGAAGCAGT TCGGCCGGGG GCCGCAGATG
AAAGCCTTCC TGCGATCGGC CACCCGGCTT GCCTTCATCA GGCAACGGAT CCTCAGTGGC
CGGGACGTTC CGGCCCACCA GCTGGACGAG CACCACCAGC TCGCAGAAGT CGTGGCCCGG
CGGGACGCCG TGCTGCGCTA G
 
Protein sequence
MMPPAAVRSR SSAGVVGLVG GGGFLAFASL FLVLPYLVGN TGVTGFVIGF IASLIPLSAV 
LLAVYVIDRW EPEPKRLLLF AFMWGAVVSI SVTLLIQPVF ALAAVPPAGV DYRTFAVTVQ
APVVEEFAKS LGLLLLLVLA RKHFDGPVDG VVFAFTIAGG FAFTENILYF GRAIAESATP
GTDLAVVFFL RGVMSPFAHA IFTGTTGLIL GFAARRWHPG MSVVAFAVGL VPAMILHSMW
NSMGQDFLVQ YIVVQVPIFV LAVVVIVLLR VAENRLTRQR LQEYAAAGWF TPPEVEMLAT
AGGRRSAVRW AKQFGRGPQM KAFLRSATRL AFIRQRILSG RDVPAHQLDE HHQLAEVVAR
RDAVLR