Gene Arth_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2244 
Symbol 
ID4445166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2523423 
End bp2525393 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content70% 
IMG OID639690053 
ProductComEC/Rec2-related protein 
Protein accessionYP_831724 
Protein GI116670791 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.425398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCAAGG TCAGTCCGTG GCACCGCTTC ATCGACGCAG CCGTCAGCGG TGAGGAGGGA 
GGGACCCCCC GGCCGCCGCG CACTGCCGAG AACCCGCCGG AGGCCCGCCA GGCAGCGGAA
GTCCGCGCTG CCGGCATCAG GGGCATCGTG CCGCGAACGG CGTCCGGCCT GCGGGAAGTG
CTCGCGGCGC GACTCCTTCC CGGACCGGAG GGCCGGGTAC AGGGGACAGG AAGACCTGCG
GGAACCGCAA CCGGCAACAG GCCGCAGCGC CGGACCGACC TGCGTCTCGT ACCTCCGGCA
CTCCTCGCCT GGGCCGCCGC CGTCGCCGGC GTCTGGTTGC CGTTGCCCGC GCTCGGGGTC
TTGATCGCGG GATTGCTCCT GGCAGCTGTG GCGCTGTTAG CTGCCATCCG CTGTCGCCGG
ATGCGGCGCC ACGGGGCAGG GCTGGTGCCA CGGAGCTTCC TGACCACCCT CGGCATTGCC
GTGGTGCTGT CCGCGGCCAT CGCGTCCCAC TCGGCCATCG CCGCATCCCA GAAGCACGAC
GGGCCCGTTG CGGATGCAGT CACGGCCCGC TCCGCTGTAG TGGTTGAGGC CGAGATCGCG
GGGACACCGC GCCAGCTGAA AATTCCCGGC CGCGGCGGTT CGGACCGTTG GGCAGTTGAA
GCCACCGCAT ACGCGATTAT CGCCAACGGC GGCCTCATCA GGAGTGACGC CCGCCTGTTG
CTCGTGGGCG GCGGGGACTG GCAGCACGTG GTCCCCGGAC AGCGGATCCG GACTACGGGA
AAGCTGAGGC CGGCCGACCA CGGCCAAACG CAGGCGGGGA CCCTGTCCGC CACCACGGAC
CCAGCCACCA CGGCAGCCAC CACCGCATGG CAGGAGGGAC CAGGCGCCCT CCGCAGGGGG
TTTGCCGCCG CAGCCGAATG GATCGGCGGC GATGCCCGCG GCCTCCTGCC GGGAATGGTC
ACGGGCGACA CCAGCTACCT CGACGACCAG CTTGGAAGCG CCATGAAAAC TGTCGGCATG
ACCCACCTGA CCGCAGTCAG CGGGGCGAAC TGCAGCCTCA TCCTGGGGAC TCTGTTGTTG
GCTGCCCGAA CTATCCGCCT GTCGAGGGCT CCCGCGGCTG CGGCTGCCCT CTCCGGCCTG
GCACTGTTCG TCCTGATGGT GGGTCCGGAC GCCAGCGTCC TCCGGGCTGC GCTCATGGGC
GCAATCGGAC TGGTCTCTCT GTCCGGCGGC CGGACCGGAC GCGGGCTCAG CTTCCTCTGC
CTGGCCGTGA TCGGGCTCCT CATGGCCGAT CCGGGACTCG GGACCAGTTT TAGCTTTCTG
CTGTCCGTGC TGGCCACTCT CGGCATCGTG ACGGCCGGCC CGCGGATCAT GGAATGGCTG
CCACCGGTTG TGCCGCGCTG GCTGGCTGCC GGCCTTGCCG TTCCGCTGTC CGCCCAACTC
TTCTGCAGTC CGGTGATCGT CCTGCTGCAG CCCCAGTTTT CGTCCTACGC CCTGCTGGCA
AACATGGTGG CGGCTCCCCT CGTGGCGCCG GTGACCATCC TCGGAACAGC CGCCGTGCCC
GTGGTGCCGC TGGCCCCTTG GGCAGCTGCG GTTCCCATGG CGGTTGCGGG TGGCTGCGCG
GCCGGAGTCG CGGCCGTTGC CAGGTTCTTC GCGGACCTGC CCGGCGCCGC CCTGCCGTGG
CCTGAGGGAC CGTTTGGCGC GGCCACCATG GTGGCGCTGT CCGGCTGCAC GCTGCTTGTG
CTGTGGCTTG TTCTGCACCC GCGCGCCTTG TGGACGCTGG TCCTGGCGAC GCACCGAAAG
ACCGTCGATC TGCTGGATCT GTGGCCCCGG TTCACCGGCT TGGATGAGCG CCGCCACCGT
GGGAGTCTTA GAGGTATTAA TCCGATGTCC GGGAGGAACC AAGAGTGGCC GCTGCGCAAA
AAGCACGATC CAAGCCGGCG ACGTCAGCGA CCGCCACCTG GCGCGACGTG A
 
Protein sequence
MGKVSPWHRF IDAAVSGEEG GTPRPPRTAE NPPEARQAAE VRAAGIRGIV PRTASGLREV 
LAARLLPGPE GRVQGTGRPA GTATGNRPQR RTDLRLVPPA LLAWAAAVAG VWLPLPALGV
LIAGLLLAAV ALLAAIRCRR MRRHGAGLVP RSFLTTLGIA VVLSAAIASH SAIAASQKHD
GPVADAVTAR SAVVVEAEIA GTPRQLKIPG RGGSDRWAVE ATAYAIIANG GLIRSDARLL
LVGGGDWQHV VPGQRIRTTG KLRPADHGQT QAGTLSATTD PATTAATTAW QEGPGALRRG
FAAAAEWIGG DARGLLPGMV TGDTSYLDDQ LGSAMKTVGM THLTAVSGAN CSLILGTLLL
AARTIRLSRA PAAAAALSGL ALFVLMVGPD ASVLRAALMG AIGLVSLSGG RTGRGLSFLC
LAVIGLLMAD PGLGTSFSFL LSVLATLGIV TAGPRIMEWL PPVVPRWLAA GLAVPLSAQL
FCSPVIVLLQ PQFSSYALLA NMVAAPLVAP VTILGTAAVP VVPLAPWAAA VPMAVAGGCA
AGVAAVARFF ADLPGAALPW PEGPFGAATM VALSGCTLLV LWLVLHPRAL WTLVLATHRK
TVDLLDLWPR FTGLDERRHR GSLRGINPMS GRNQEWPLRK KHDPSRRRQR PPPGAT