Gene Arth_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4033 
Symbol 
ID4447869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4552374 
End bp4553816 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content67% 
IMG OID639691864 
Productadenylosuccinate lyase 
Protein accessionYP_833508 
Protein GI116672575 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAA CTGCCGCCAC AGCTGAGACC CGTACGCCTT CCGGACGCCT GGCCCTTGCC 
GCCTCGCCGG ACAAGATCGC CCTAGGCCCG CTGGACGGCC GGTACCAGTC CGCGGTCGCA
CCGTTGGTGG ATTACCTGTC CGAGGCGGCC CTCAACCGGG ATCGCGTGGC CGTGGAAGTC
GAGTGGCTCA TCCACCTGAC CAGCAACAGC GTCCTCCCGG GCGCCGGCCC GCTGACGCCC
GAACAGCAGG ACCAGCTCCG CGCCATCGTC ACGGAATTCG ACTCCGCGTC GGTCACCGAG
CTGGCCGACA TCGAGGCCGT AACGGTTCAC GACGTCAAGG CAGTCGAGTA CTACATCGGC
CGCAGGCTGC CGGCCATCGG CATTGAGCGG CTTACCGCCA TGGTGCACTT CGGCTGCACC
TCGGAAGACA TCAACAACCT CTCCTACGCG CTGGGCGTCA AGGGCGCCGT GGAGGACGTG
TGGCTGCCCG CCGCCAAGGC GCTGGTGGCC CAGATCAGCA GGATGGCTGA CGACAACCGC
AGCGTGCCCA TGCTTTCCCG CACGCACGGA CAGCCGGCCA CGCCCACCAC CCTGGGCAAG
GAACTGGCCG TCATCGCGCA CCGCCTGACC CGCCAGCTGG ACCGGATTGC CAGGACGGAA
TACCTGGGCA AAATCAACGG CGCCACCGGC ACCTACGCCG CCCACGTCGC TTCCGTTCCC
GGCGCGGACT GGCAGCACGT GGCGAAGTCC TTCGTTGAGG GCCTGGGCCT GACCTGGAAT
CCGCTGACCA CCCAGATCGA AAGCCACGAC TGGCAGGCGG AGTTGTACGC CGACGTCGCG
CGGTTCAACC GGATCCTGCA CAACGTGTGC ACCGACATCT GGAGCTACAT CTCCATCGGC
TACTTCGCGC AGATCCCGGT GGCGGGCGCC ACGGGTTCCT CCACCATGCC GCACAAGGTC
AACCCGATCC GCTTTGAGAA CGCCGAAGCC AACCTGGAGA TCTCCTCCGG CCTGCTGGAC
GTGCTGGGCT CCACGCTGGT CACCTCGCGC TGGCAGCGCG ACCTCACCGA CTCCTCCAGC
CAGCGCAACA TCGGCGTGGC CTTCGGGCAC TCCCTGCTGG CCATCTCGAA TGTGGTCAAG
GGCCTGGAGC GCCTGGACGT AGCCGAGGAC GTCCTGGCGG GCGACCTCGA CACCAACTGG
GAAGTTCTGG GCGAGGCCAT CCAAATGGTG ATGCGCGCCG AGGCGATTGC CGGCGTCGAA
GGAATGGAAA ACCCCTACGA GCGGCTCAAG GACCTGACCC GCGGACAGCG CGTGGATGCC
GCCCGGATGC AGGAATTCGT CCAGGGCCTG GGCCTCTCCG CGGACGCCGA AGCCCGGCTG
CTGGCCCTGA CACCGGGCAA GTACACAGGC ATCGCGGACC AGCTGGTGGA CCACCTCAAA
TGA
 
Protein sequence
MPETAATAET RTPSGRLALA ASPDKIALGP LDGRYQSAVA PLVDYLSEAA LNRDRVAVEV 
EWLIHLTSNS VLPGAGPLTP EQQDQLRAIV TEFDSASVTE LADIEAVTVH DVKAVEYYIG
RRLPAIGIER LTAMVHFGCT SEDINNLSYA LGVKGAVEDV WLPAAKALVA QISRMADDNR
SVPMLSRTHG QPATPTTLGK ELAVIAHRLT RQLDRIARTE YLGKINGATG TYAAHVASVP
GADWQHVAKS FVEGLGLTWN PLTTQIESHD WQAELYADVA RFNRILHNVC TDIWSYISIG
YFAQIPVAGA TGSSTMPHKV NPIRFENAEA NLEISSGLLD VLGSTLVTSR WQRDLTDSSS
QRNIGVAFGH SLLAISNVVK GLERLDVAED VLAGDLDTNW EVLGEAIQMV MRAEAIAGVE
GMENPYERLK DLTRGQRVDA ARMQEFVQGL GLSADAEARL LALTPGKYTG IADQLVDHLK