Gene Arth_3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3165 
Symbol 
ID4444225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3554170 
End bp3555876 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content62% 
IMG OID639690991 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_832643 
Protein GI116671710 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.845212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGAAC AATTGCATAT CTTTCGCGGC GCTTTGTACC GCTCTCGTGA GCTCGTTTCA 
AGAGGAGCCA AATCAGCGCT TGGCGCGCTG ATCTGCCTGT CATTGTTGTC AGCATCGTTG
CTGGCGGGAA TGCCGGCAGC GCAAGCTGCG GAGCTGGTGG GTGGCGAGAC AATCGACCGG
ACCAATCAGG CCAAAATCGT CGAAGTTTTC AATGGCATCA ACGACTTCCG GGCCTCCCAA
GGGCTCAATC CGGTGAATTT CAATGCCACT GTCTCGGAGA TGGCCGAAGA CTGGTCCGAC
CATATGGCGG CTTCCGGTAA TTTTGTCCAC AACCCGAATT TCTACACCGA TGCCAGGGTC
ACGGGCCGCT TCGCCGGCGC GGCCGAAATC ATCGCAGCCC GTTCGGATGA CTGGGCGCAG
GGACTGGTGG AGCAGTGGAT CGACTCACCC GGACACAACG CCGTCATGAG CGATCCCAAG
CTCACCACCG TGGGCGTAGG GATCACCTAT TTGGAGGGCA AACGCAGCGG CGAGCTGACG
TTGTATGGCA CGGTGAATTT CTTCACGTTC TGGAATCCGC CGGTTGGGAT GTACACCACC
GCCCAGGACT TCTTCGACGG CAAGCCCTCC ATCGACACGG CACAGATAAT CACCGTCGAT
ACTGAAGATC CCGTCTTTGA CGACGGGCTC AATAAAGTGA CCATCCCTGA TGCCCAGGGA
GTGGACTACT TTGTGAACCA GGTTCCGACG CCGCCCGGAA CCTACGACGC CGTGGTGGGC
CGCATGCAGG TCACTGCGAC CGCCAAGGCG GGCTACCGCG TCTTCGGGAC ACTGCGTTGG
GAGCATGAGT TCCTGGCACC GCCCACGGAA ATCGTCCCGA TCGCTCCGAC GTTTGATGGC
GTAACCGGCC GGTTCACTGT TCCTTACATG GAAGGGGTCC AGTACTGGGT CAACGATTCC
CTCTGGGGCT CCTCAACCTT CAGCAGCGAG TGGGACACAA CGGTCTATAT CCGGGCCGAG
GCCCTCAAGG GCTACAAGCT CAGCGGCACC ACGTCCTGGC AGTACTACTT CGCCAAGCCG
GACCAGCCGG CTGATCCAGG CCCGGAGACG CCCGCAGAGC CGTCCGGCAG CTTTACCGAT
GTCCCTGCAG GAACGCAGTT CGCCACCGAG ATCAACTGGC TCGCCAGCCG TGGCATCAGC
ACCGGCTGGG TAGAAGGCGA CGGCTCTGCA ACGTACCGGC CGCTGACTCC GGTCAACCGG
GACGCCATGG CCGCCTTCAT GTACCGCCTG CTGGGCGAGC CGCCCTTCAA CGCCCCGGCC
GCGTCTCCGT TCGCGGACAT GTCCAGCAGC ACCAAGTTCT ACAAGGAGAT CACCTGGCTC
GCGGACAAGC GCATTTCCAC TGGCTGGGAA GTCAACGGGG CGCGGACGTA CCGCCCTGTG
ACGCCCGTCA ACCGGGACGC CATGGCCGCC TTCCTTTACC GGCTCGCCGG CCAGCCCAGC
TTCACGCCCC CGCTCGCCTC GCCGTTCATG GATGTGGATA CGAGCAACCA GTTCTACAAG
GAAATCACGT GGCTGGCCGC GCAGGGCATC TCCTCGGGCT GGAACGAAGG CAACGGCAAA
GCCAGCTATC GGCCCTGGTC TGCCGTCAAC CGCGACGCCA TGGCCGCCTT CATGTACCGC
TGGAACACAA AATTCGGCCG GCCATGA
 
Protein sequence
MGEQLHIFRG ALYRSRELVS RGAKSALGAL ICLSLLSASL LAGMPAAQAA ELVGGETIDR 
TNQAKIVEVF NGINDFRASQ GLNPVNFNAT VSEMAEDWSD HMAASGNFVH NPNFYTDARV
TGRFAGAAEI IAARSDDWAQ GLVEQWIDSP GHNAVMSDPK LTTVGVGITY LEGKRSGELT
LYGTVNFFTF WNPPVGMYTT AQDFFDGKPS IDTAQIITVD TEDPVFDDGL NKVTIPDAQG
VDYFVNQVPT PPGTYDAVVG RMQVTATAKA GYRVFGTLRW EHEFLAPPTE IVPIAPTFDG
VTGRFTVPYM EGVQYWVNDS LWGSSTFSSE WDTTVYIRAE ALKGYKLSGT TSWQYYFAKP
DQPADPGPET PAEPSGSFTD VPAGTQFATE INWLASRGIS TGWVEGDGSA TYRPLTPVNR
DAMAAFMYRL LGEPPFNAPA ASPFADMSSS TKFYKEITWL ADKRISTGWE VNGARTYRPV
TPVNRDAMAA FLYRLAGQPS FTPPLASPFM DVDTSNQFYK EITWLAAQGI SSGWNEGNGK
ASYRPWSAVN RDAMAAFMYR WNTKFGRP