Gene Arth_3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3878 
Symbol 
ID4446835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4361996 
End bp4363762 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content68% 
IMG OID639691702 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_833353 
Protein GI116672420 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGAAGA TTTCGGCACG GACACTGGCA GCCATGATCT GCGCACTGGC ACTCATCACG 
CCTGCGGCCG GCCAGCTGGG GGGACCTTCC TCAAGCCCAA AACCCGATGT GTCCGCCGTG
GCGGACCAGC AGGCGCAAAG CGTCGCCCTC GGCCCGCCAT CGTTCGTGTA CGACGCCGGC
TTGGGCACCG CGGATGACAC ACCAGGGCAG CGCCTCGCCC CCGCCGTGGA CCCGGCGGTC
AGGGAAGGGG ACCCAGGCGG CACAACTGCT GCTTTGGTCA ATACCTCAGG AACGGCCGGC
GCCGGTGGCG GCGCCGGCAC ACTCTCCACC AACACGCCGC CGCTGCCAGC AGGAGACTCC
CCTCCCGAGC CGGCGTCCGT CCCACCGTTG TCCGCCACCG ACGCCGACCT GGCGGCCCTC
ACCCAGGCCG GGCTGCAGAC CCGGCCTGCA CCCACGGCGG GCACCGAAGC CCTCCGAACC
GAATCACTGA CGACGCAGGC CCTCGTCAGG GACGACGCGT CGTCCCAAAT CCTGGCGGTG
TTCAAAGCCA TCAACAGCTA CCGGGCTTCG TTCGGACTGC CCGCCGTGAA GTACCACGCC
ACAGTGGCCG CCATGGCCCA GGAGTGGTCC GACAGCATCG CGGCCCGGGA AGTGATCGAG
CACCGCTCGA GCTTCTGGAC CGATTCGCGG GCGCTCAGCC CCACCAATGG CGCCGGCGAG
GTCATTGCCG TCCGCTGGGA CCGCGACGCC GCCCAGCTTG TTGAGTGGTG GAAGGGCTCG
CCCGCCCACA ATGCAATCCT GAAGGACCCG CGGTTCAATG TGATGGGGAT CGGGATCACC
TTCACGGACG GCAACTGGCA GACCACGCCC AACCGCTACA CCATGTGGGG CGTGGTGGAC
TTCTTCGGAT ACGGCACGCT GCCCGCGGGA ACCACCAGCA GCCCGGGCGG CAGCACTGAA
ATGCCCGTAC AGCCCGCCAG CGTGTGCGAT CCGCTGGTGC GGCACATGCC GCCGTCGGCG
GACCTTGCGG CGGCCGCGAT CAAGGGTCCC GGCGACCTCG TGTCGGTGAA CTCATCCGGG
GAACTCATCA ACCGCCCGTC CCTGGGAAAC CGGCAATACG GCGCACAACA GGTCGTCGGG
ACCGGATTCG GCTCCGCCAA GGAACTCTTT GTCACCGACT GGGACCGGGA CGGAGTCTTT
GACATCCTGG TCCAGTGGAC TGATGGCAGG GTTACGCTGC ACGCCGGCTC GGTGGGCGGC
GGATTCCTTC CAGGCGTGAC ACTGGGCCAG TCCGGGTGGG CGGGAATGAC CCTGGCGGTC
GGGGGCTGGT GTGCCAACAA CCGCCTCCCG CAACTGGTGG CGCTGGACAC CTCCGGGAAC
CTCTGGCTGT ACCCCAACCG GGGCAAAGCG GACCTTGTGC AGCGGACTCT GATGGCGTCC
GGCGTTTCAG CCAACCGGCT GGCCATGGCG GATTACGACG GCGACGGCTT CCAGGACCTG
TTGGCCCGGC AGTCGGACGG TTATGTCCGG CTCTTCCGCG GCTCGGGCGC GCCGGCACCG
CGCGCCGAAA CCCGGGCTGT GGTGGCCAGC GGATGGTCGG ACGTTACAGC CATCCGTCCG
CTGCGGGATG TCACGGGCCT GAACTCAACG GGACTGGCCC TCCGACGAGC CGGCGACGTG
GTGCAGTATT GGGACCTCAG CACGGGCGCC TTGACGTCGC CGTCGTCCAT CCCCGGAACG
TGGGCGGGAC AGCGCCTCGC GCAATAG
 
Protein sequence
MRKISARTLA AMICALALIT PAAGQLGGPS SSPKPDVSAV ADQQAQSVAL GPPSFVYDAG 
LGTADDTPGQ RLAPAVDPAV REGDPGGTTA ALVNTSGTAG AGGGAGTLST NTPPLPAGDS
PPEPASVPPL SATDADLAAL TQAGLQTRPA PTAGTEALRT ESLTTQALVR DDASSQILAV
FKAINSYRAS FGLPAVKYHA TVAAMAQEWS DSIAAREVIE HRSSFWTDSR ALSPTNGAGE
VIAVRWDRDA AQLVEWWKGS PAHNAILKDP RFNVMGIGIT FTDGNWQTTP NRYTMWGVVD
FFGYGTLPAG TTSSPGGSTE MPVQPASVCD PLVRHMPPSA DLAAAAIKGP GDLVSVNSSG
ELINRPSLGN RQYGAQQVVG TGFGSAKELF VTDWDRDGVF DILVQWTDGR VTLHAGSVGG
GFLPGVTLGQ SGWAGMTLAV GGWCANNRLP QLVALDTSGN LWLYPNRGKA DLVQRTLMAS
GVSANRLAMA DYDGDGFQDL LARQSDGYVR LFRGSGAPAP RAETRAVVAS GWSDVTAIRP
LRDVTGLNST GLALRRAGDV VQYWDLSTGA LTSPSSIPGT WAGQRLAQ