Gene Arth_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0947 
Symbol 
ID4446540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1017595 
End bp1018893 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID639688753 
Producthypothetical protein 
Protein accessionYP_830444 
Protein GI116669511 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGCG GCACGTTGGT CAACTTCGCC TTGGTTCTCT TCTTCGTACT GCTGGGCGGC 
GTCTTTGCAG CCACCGAAAT GGCGCTCATT TCCCTCCGGG AAAGCCAGGT GCGCATGATC
GAGAAGGCCG GCAAACGCGG CGCCCGGGCC GCTGCGCTGG CCCGCAACCC CAACCGGTTC
CTCTCCACCG TGCAGATCGG CGTGACGCTC TCCGGCTTCT TCTCAGCCGC CTACGGCGCG
TCCACCATTT CACCCGACAT CGAACCCATC CTGAAAGGCG CGGGGTTCGG CGCCGCGGCC
GAGCCGGTGG CCTTTATCGG CATAACCCTG CTGGTGGCCT ACCTGTCCCT GGTGCTGGGC
GAGCTGGTGC CCAAAAGGCT GGCTATGCAG AGCGCCGTCG GCTTCACCAA GGTCCTGGCC
CCGCCGCTGG TGGTCCTTTC CGAGGTCATG CGGCCCGTCA TCTGGCTGCT GTCCGTTTCC
ACCGACGCCG TGGTCCGGCT CTTTGGCGGT GACCCGCACG CCAAGCGGGA GGGGATCAGC
TCCGAGGAAC TCTGGGACAT GGTGGCGGAG AGCGACCTGC TGGAAGAGAG CAGCCGGCAC
ATCCTGACCG ACGTGTTCGG CGCCGGGGAC CGCACACTGC AGGAGGTCAT GCGCCCCCGC
ACCGAAGTGA CCTTCATTGA CGGCACCATG ACTATTGCCG ACGCACGCAG CATGGTCCGG
GACGGCCCGT ATTCGCGGTT CCCTGTGATC GGCAGGACCC CGGACGACGT CCTGGGCTTC
GTCCACATCC GGGACCTGAT GACCCGGACT GAACAGCAGG ACCAGGGGCT GGTGAAGGAC
ATCGTCCGCG AACTCCTCCC CCTGCCGGGA ACCAACCGGG TGCTGCCGAC GCTGTCGCGG
ATGCGCCGGC TGGGCCACCA CATCGCGCTG GTGGTGGACG AATACGGCGG CACCGACGGC
ATCGTCACGC TGGAGGACCT GGTCGAGGAG TTGGTGGGCG AAATCTACGA CGAATACGAC
ACCGGGGCCG ACCACGAGGA CCGCGTCACC GTGGCCAACG GATCCATCGA CGTGGACGGC
GGCCTGATCC TGCAGGAATT CGCCGCTGCC ACCGGCATCA CCCTGCCGGA GGGCCGCTAC
GAGACAGTGG CCGGGTTCGT CATCTCCCGC CTGGGCCGCC TGCCCGTGGT CGGGGACCGG
GTGCAGGTGC CGGGCCAAGT GCTGACGGTG CTCGCCATGG ACAGGCTCCG CATCGCCCGG
ATCCGGGTGA CGCCCGTGAC CGGGCAGCCG GCGGTCTAG
 
Protein sequence
MDSGTLVNFA LVLFFVLLGG VFAATEMALI SLRESQVRMI EKAGKRGARA AALARNPNRF 
LSTVQIGVTL SGFFSAAYGA STISPDIEPI LKGAGFGAAA EPVAFIGITL LVAYLSLVLG
ELVPKRLAMQ SAVGFTKVLA PPLVVLSEVM RPVIWLLSVS TDAVVRLFGG DPHAKREGIS
SEELWDMVAE SDLLEESSRH ILTDVFGAGD RTLQEVMRPR TEVTFIDGTM TIADARSMVR
DGPYSRFPVI GRTPDDVLGF VHIRDLMTRT EQQDQGLVKD IVRELLPLPG TNRVLPTLSR
MRRLGHHIAL VVDEYGGTDG IVTLEDLVEE LVGEIYDEYD TGADHEDRVT VANGSIDVDG
GLILQEFAAA TGITLPEGRY ETVAGFVISR LGRLPVVGDR VQVPGQVLTV LAMDRLRIAR
IRVTPVTGQP AV