Gene Arth_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1007 
Symbol 
ID4446512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1085428 
End bp1086621 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content62% 
IMG OID639688812 
Productputative ATP-dependent DNA helicase 
Protein accessionYP_830503 
Protein GI116669570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTAA GACTGGAAAC ACAAAATAAC CCACTCCCCA CACTATTGGA TCGGGCGGAC 
ATGTCAGAAG GGCGAGAGGG GAAAATGCGC TCCCTGGGTG GGGAGTTCCG GTCGGCAGAG
GGTGGGCTCG TCACCACCGC CACCGCCAGT GAGGGCCTAG CCCTGGACAC GCCCATCCCC
GCACCGAAGG GCTGGACCAC CGTGGGGAAG CTTGGGGCCG GGGACAAGAT CTACGGCACC
ACTGGGGAAC CTGTAACCGT TGTCGAGGCT TTCCCCGTAC AGACCAACCG GGACTGCTAT
CGTGTTACGT TCCGGGACGG AACCTCGCTC GTGGCCTCAG ACGGAAACCT GTGGCAGGCT
CGGCCCACGG GCTGGCCGGC ATCGCACAAC CGCGTCTGGA CAACGCGCCA GATGTATGAC
CACAGCGCGA AGCGGTGGAG CATACTAACC CCCGGGCCAC AGCAGGGGCC TACCCGCGAT
CTTCCGGTTG AACCTTACCT TCTCGGTTAC TGGCTGGGCG ACGGCAGTAC GGGAGCCTGC
AACATCACGG TCGGCGATGA AGACCTGGAA GTTTTCACTG CCAACATGGA TGCCATCGGG
GTAGAAGTCC ACCCAGTCGG AGCAAAAAAG GGCAATTGCA CCCGCATGTC TTTTTCGTCC
AAGGTTGGCT TTGGTGCTGA CATGGGCGGA ACCGACGCAC GGGCGCTTCG CAAGCTCGCC
TGCTTTCGCA ACAAGCACAT CCCCGAAGAG TACCTGGAGG GGTCGATCTC GCAGCGGACG
GCCCTGCTGC AGGGCCTGCT CGACTCGGAC GGCTGGGCGA GCGGGCGCGG GGTTGGCTTT
TGCGGGCGCG AACGACTGGT CAACGATGTC ATCAGACTTC TGCGGTCACT GGGAGAAAAG
CCCATGAGGA CCTTCGCCGC CCATGCTCAG TCCCGGGACG GTGGAACTTG GAGGATTCAC
TTTATTCCTC GGAACATCAC CGAATGTTTT CGCCTTCCGC GCAAACAGGA TCGGGTCAAT
CCGGCCAAAC GAACGACCAC GGCAATTGAA TCCATCGAAC CGGTCGGGTC CGTGCTGGTC
CGAGGCATCC GAGTGGATAC CAAGGACTCG CTCTTCCAGG CGGGCGCGGG ATGCCAGCTC
ACTCACAACA CACGCCAATT GCCCCCATTG CCAGCGCAGC ACGGGCTTTT TTAG
 
Protein sequence
MDLRLETQNN PLPTLLDRAD MSEGREGKMR SLGGEFRSAE GGLVTTATAS EGLALDTPIP 
APKGWTTVGK LGAGDKIYGT TGEPVTVVEA FPVQTNRDCY RVTFRDGTSL VASDGNLWQA
RPTGWPASHN RVWTTRQMYD HSAKRWSILT PGPQQGPTRD LPVEPYLLGY WLGDGSTGAC
NITVGDEDLE VFTANMDAIG VEVHPVGAKK GNCTRMSFSS KVGFGADMGG TDARALRKLA
CFRNKHIPEE YLEGSISQRT ALLQGLLDSD GWASGRGVGF CGRERLVNDV IRLLRSLGEK
PMRTFAAHAQ SRDGGTWRIH FIPRNITECF RLPRKQDRVN PAKRTTTAIE SIEPVGSVLV
RGIRVDTKDS LFQAGAGCQL THNTRQLPPL PAQHGLF