Gene Arth_2245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2245 
Symbol 
ID4445167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2525395 
End bp2526375 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID639690054 
Producthelix-hairpin-helix repeat-containing competence protein ComEA 
Protein accessionYP_831725 
Protein GI116670792 
COG category[L] Replication, recombination and repair 
COG ID[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region
[TIGR01259] comEA protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGCC GGAACGCGGA AGCGGGAGCA CCTGCCGCGG CCAGCCGCGC GCGCCGGCGC 
CTGGCAACCA CACTGGGGCC TTCCGGCGGC GGTGACGGCG GTGGATCCGA TGCCGGGAAC
CCGGGTTCGG CGGGGCTCCT GGGGTACGGC GCCGGGGTTC ACGGCCACGG GTCTTTCACG
TATGACGGCG GTGCGCAGCC GGAACCTGGC GCCGTTGGCG GCTCGGCCGC CGATTCATCC
GCCGTTTCCG GAACACATGC ACCGGCGAAG TTCCGCTGGC GCTCGGGGTT TCGCGTTGCG
GTGCTGCTGG GACTGCTGAG TCTCCTGCTG GGCGGCTGGT TCTGGTGGGA TGTGGCAGCC
AGCCGTCCCC ACGTGGTGCC CTTGAGCGAC GTCAGCAGCC CGGAAGTCAG CGCGCAACAG
GAAGGACATC CCGGTTCAGG GCCGGGAGGA TCCGACGGCG CCTCAACGGG ACGACAGCCT
TCAGGTGCCG CATCCGGAGC GAAGATCATT GTCCACGTGG CGGGGGCCGT CAACCGGGCG
GGCGTTGTGG AACTGCCGGA AGGCAGCCGG GTCCACGAGG CCATCGCCGG TGCGGGTGGA
AGTGCGGAAG GGGCTGACCT GAACCGGTTG AACCTTGCCG CCGTCCTGGC CGACGGCCAG
AAAATCCACG TTCCGCTGGT GGGGGAACCG GTGGACGCCC CTGGGGCGGC AGCCGGCGCC
ACCGGACCGG GAGCCGCGGG ATCGGGGCCC GGCGACTCAG TCCCCGGCCA AACGGGAGCG
GAGGGCGGGA AGATCGACCT GAATTCGGCG TCTGCCGAGG AACTGGGCGC ATTGCCCCGG
GTTGGTCCCG TGCTGGCCCA GCGCATCGTC GATTGGCGCA AGGAACACGG CCGGTTCAGC
ACCGTTGAGG AACTAGACGC GGTGGACGGT GTTGGCCCCA AGATGCTGGA AACGCTGCTG
CCCCTCGTTC GGGTGTCCTG A
 
Protein sequence
MSRRNAEAGA PAAASRARRR LATTLGPSGG GDGGGSDAGN PGSAGLLGYG AGVHGHGSFT 
YDGGAQPEPG AVGGSAADSS AVSGTHAPAK FRWRSGFRVA VLLGLLSLLL GGWFWWDVAA
SRPHVVPLSD VSSPEVSAQQ EGHPGSGPGG SDGASTGRQP SGAASGAKII VHVAGAVNRA
GVVELPEGSR VHEAIAGAGG SAEGADLNRL NLAAVLADGQ KIHVPLVGEP VDAPGAAAGA
TGPGAAGSGP GDSVPGQTGA EGGKIDLNSA SAEELGALPR VGPVLAQRIV DWRKEHGRFS
TVEELDAVDG VGPKMLETLL PLVRVS