Gene Arth_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3166 
Symbol 
ID4444226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3555873 
End bp3557258 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content65% 
IMG OID639690992 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_832644 
Protein GI116671711 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCCGC TCGCGTACCG CTCATCAACC TTCAGGGGGA ACCAATTGCA CCTACTCGTT 
ACAAGCCTGC GCTCGTTGCG GAGTCCACGG ACCGCACGGA AGGCGGCAGC CGCTCTCACC
TGCACCATGC TGCTTTCCGC AACGTTCTTT GTTGGCGGTC CGGCCGCCAC CGCGGCTCCC
GTGGCGCCCG CAACCGAAGC GTCGGATACC CGCCAGGGTG AGCAAGTGGG AACTCCAGCG
GTCGTACCGG CGCCCGCCGC CGCCCAGGCT CCGGCAGCCG GACCACTGCC CGGCGCGCCC
ACCACCGGAA GCGGACCCGT CACCGAAGGC ACCACGACCG CCGGCGGAAC CATCAACAGG
GCCAACAGCC AGCAGCTCAC CGATGTCTTC AACGCCATCA ATGCCTTCCG GGCCACGAAG
GGGCTGCAGG CCGTGACATT CAATGCCACG GTGTCCGAGA TGGCCGAGGA CTGGTCCGAC
TACATGGCCA GCAGCGGCAA ATTCGAGCAC AATCCCGGCT TCCACACCGA TCCGCGTGTG
GAAGGCCGCT GGACCTATGC AGGCGAAATC ATCGCCGGCC GCCCGGACCA CTCCGGAGCC
GGTCTGGTGG ACCAGTGGAT CGCCTCGCCC GGCCACAACG CAATCATGAG TGACCCCAAC
TACACCACCA TCGGCATCGG CATCGCCCGG CTTTCCGATC CCGACTACCT CCTGGGCACC
GTGAACTTCT TCAATTTCGC GATGCCCCCG GCCGGCTCGT ACGCCACAGC TGCGGAGTTC
CTGAACGGTT CAGCAGAATC GACCCCGTTC GCGGATGTCC CTTCCGGCAC CCAGTTCGCG
GATGAGATCA ACTGGCTCGC CAGCCAGGGC ATCAGCACGG GCTGGAAGGA AGCGAACGGA
ACCACCACCT ACCGTCCCGT GACGCCGGTC AACCGCGATG CCATGGCAGC GTTTATGTAC
CGCTTGGTGG GCGAACCCCC GTACAGCGCG CCGTACTACT CCTGGTTCGC AGACATCGCT
CCCGGGACGC AGTTCTACAA GGAGATCAAC TGGCTGGCGG AATACGGCAT CTCCGGCGGC
TGGGATGAAG GCAACGGCCA GTACACGTAC CGTCCGCTGA CGCCGGTGAA GCGCGACGCC
ATGGCCGCGT TCCTGTACCG GCTCATGGGC CAGCCCGCGT TCACCCCCCC GGCCACTTCG
CCCTTCGTAG ATGTGTCCAC CAGCAACCAG TTCTACAAGG AGATCACCTG GCTGGCCTCC
CTGAAGATCT CCACCGGCTG GGATGAAGGA AACGGAAAGT TCTCCTACCG GCCGCTGAAC
TCGGTCAACC GCGATGCCAT GGCCGCCTTC ATGTACCGCC TGGTCACCAG CCCGGCGAAC
GGCTAG
 
Protein sequence
MRPLAYRSST FRGNQLHLLV TSLRSLRSPR TARKAAAALT CTMLLSATFF VGGPAATAAP 
VAPATEASDT RQGEQVGTPA VVPAPAAAQA PAAGPLPGAP TTGSGPVTEG TTTAGGTINR
ANSQQLTDVF NAINAFRATK GLQAVTFNAT VSEMAEDWSD YMASSGKFEH NPGFHTDPRV
EGRWTYAGEI IAGRPDHSGA GLVDQWIASP GHNAIMSDPN YTTIGIGIAR LSDPDYLLGT
VNFFNFAMPP AGSYATAAEF LNGSAESTPF ADVPSGTQFA DEINWLASQG ISTGWKEANG
TTTYRPVTPV NRDAMAAFMY RLVGEPPYSA PYYSWFADIA PGTQFYKEIN WLAEYGISGG
WDEGNGQYTY RPLTPVKRDA MAAFLYRLMG QPAFTPPATS PFVDVSTSNQ FYKEITWLAS
LKISTGWDEG NGKFSYRPLN SVNRDAMAAF MYRLVTSPAN G