Gene Arth_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2031 
Symbol 
ID4445440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2290732 
End bp2291754 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content66% 
IMG OID639689839 
Producthypothetical protein 
Protein accessionYP_831511 
Protein GI116670578 
COG category[L] Replication, recombination and repair 
COG ID[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02778] DNA polymerase LigD, polymerase domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.232126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGCG AATCCACCAC CCTCACCGTC GGGGGCCCGA ACGGCCCCCG CGACATGCGG 
ATCTCAAGTC CCGGCCGTGT CCTATGGCCG GATCTCGGGC TGACCAAGCT GGACCTTGCG
CGCTACCTCG CGGAGGTGGG TGACGCCTTT ATCGCGGCGA ACGGCGGACG TCCCGTTGCC
CTGCAGCGGT TTTCGGACAA CGTCGACGGC GAGCAGTTCT TCTCCAAAAA TCCACCCAAG
GGCACGCCGG ACTTCATCCG GTCGGTGAAG GTGGTCTTTC CGAGTGCGCG TTCTCACCCG
ATGCTGGTCC TGGACGAACC GGCCGCCGCC GTCTGGGCTG CCCAGATGAA CACCGTGGTG
TTCCACCCCT GGCCGTCGCG TGCCGAAAAC ACGGACAACC CGGACCAGTT GCGGATCGAC
CTGGACCCCC AGCCGGGAAC CGACTTCGAC GACGCCATCC CTGCAGCCCT GGAGCTGAAG
GAGGTGCTCG CGGAAGCCGG ACTCGCCACC TTTATCAAGA CCTCGGGGAA CCGCGGCCTC
CACGTCTATG CGCCGGTGGA GCCGGCTTTT GAGTTCCTGG ATGTCCGCCA CGCAGTCATC
GCCGCCGCCC GTGAACTGGA GCGGCGGATG CCGGACAAGG TCACCACGGC CTGGTGGAAG
GAAGAACGCG GCGAACGGGT GTTCGTGGAC TTCAACCAGG CAAACCGCGA CCGCACCATC
GCCGGCGCCT ACAGCCCCCG TGCACTGGGC CACGCCCCGG TGTCCTGCCC GATCACCTGG
GACGAACTGG GCAGTGCGGA CCCGAAGGAT TTCACCATTC TCACCGTCCC CGAACGGCTC
CGGACTGTCG GGGACCCGTG GGCGGACATG AACGCCAACC CGGGAAAAAT TGACGTGCTG
CTCGAGTGGT GGGAGCGCGA CGTCGGCTCC GGACTGGGGG AGCTTCCGTT CCCGCCGGAC
TACCCCAAGA TGCCAGGTGA ACCTCCGCGG GTTCAGCCCA GCAGGGCCCG CAAGAAGGAC
TAA
 
Protein sequence
MASESTTLTV GGPNGPRDMR ISSPGRVLWP DLGLTKLDLA RYLAEVGDAF IAANGGRPVA 
LQRFSDNVDG EQFFSKNPPK GTPDFIRSVK VVFPSARSHP MLVLDEPAAA VWAAQMNTVV
FHPWPSRAEN TDNPDQLRID LDPQPGTDFD DAIPAALELK EVLAEAGLAT FIKTSGNRGL
HVYAPVEPAF EFLDVRHAVI AAARELERRM PDKVTTAWWK EERGERVFVD FNQANRDRTI
AGAYSPRALG HAPVSCPITW DELGSADPKD FTILTVPERL RTVGDPWADM NANPGKIDVL
LEWWERDVGS GLGELPFPPD YPKMPGEPPR VQPSRARKKD