Gene Arth_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3850 
Symbol 
ID4447549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4331785 
End bp4332723 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content69% 
IMG OID639691674 
Product4-amino-4-deoxychorismate lyase 
Protein accessionYP_833325 
Protein GI116672392 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCTG CTGCCCCGGC CACCGTTCTC GTATTCCTTG ATCCCCGGTT CGACGACGGC 
CGGATTGCCG ATGCTTCGCA GCCGCAGCTG ATGGCCACGG ACCAAGGTGC CACGCGCGGC
GACGGCGTGT TTGAGTCCAT GCTGGCCGTG GGCGGAAACC CGCGGAAGCT GGACGCCCAC
CTGCGGCGGC TGCAGGGTTC GGCCCGCGCG CTGGAGCTGG ACATCCCGGG CGAGGACACC
TGGCGCCGGG CCATAGCGAC GGCGGTGGCC GAATACCGTT CCCAGCATCC CGCCGGCACA
CCGGAGGAAG ACGAGACGGT GGTGAAGCTG ATCTGCACCC GCGGCGCCGA GGGCGGAGCA
CGCCCCACCT GCTGGGTCCA GGCTTCCCCC GTCCCCGCCG CCGGCCGGCG CCAGCGCGAG
ACGGGGATAG ACGTCATCCT CCTGGACCGC GGCTACGACA GTGAAGTGGG TGAACGGGCG
CCGTGGCTGC TGCTGGGCGC CAAGACGCTC TCCTACGCGG TGAACATGGC GGCCCTGCGG
TACGCCCACA ACCAGGGGGC GGACGACGTC ATCTTCACCT CGTCCGACGG ACGGGTCCTC
GAAGGCCCCA CGTCCACCGT CCTGCTGGCG CACCTTGACA CAGTCGACGA CGGCGGCACC
CGCACGGTGC GCCGCCTCAT CACGCCGCAG CTGGACAGCG GCATCCTCCC GGGAACGTCT
CAGGGCGCGC TCTTTGCTGC CGCCAAAGCT GCGGGCTGGG AACTCGGCTA CGGCCCGCTG
GAGCCGAGGG ACCTTTTCGA CGCCGACGCC GTGTGGCTGA TTTCCAGCAT CCGGCTGCTG
GCTCCCGTGA ACCATATCGA CGGCAAGGAA ATCGGCACGC CCGCCCTCCG GAAGCAGCTG
ACCGACGAGC TCAACCAGCT GTTCGCCACG ATCGAATAG
 
Protein sequence
MTPAAPATVL VFLDPRFDDG RIADASQPQL MATDQGATRG DGVFESMLAV GGNPRKLDAH 
LRRLQGSARA LELDIPGEDT WRRAIATAVA EYRSQHPAGT PEEDETVVKL ICTRGAEGGA
RPTCWVQASP VPAAGRRQRE TGIDVILLDR GYDSEVGERA PWLLLGAKTL SYAVNMAALR
YAHNQGADDV IFTSSDGRVL EGPTSTVLLA HLDTVDDGGT RTVRRLITPQ LDSGILPGTS
QGALFAAAKA AGWELGYGPL EPRDLFDADA VWLISSIRLL APVNHIDGKE IGTPALRKQL
TDELNQLFAT IE