Gene Haur_4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4138 
Symbol 
ID5735999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5285338 
End bp5286702 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content54% 
IMG OID641281292 
Productargininosuccinate lyase 
Protein accessionYP_001546898 
Protein GI159900651 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGGTG GACGATTTAG TGGCTCGTTG GCTGAGCATA TGCGCTTATT CAACGATTCG 
TTTCCAATCG ATCGGCGCTT ATGGGCCGAG GATATTCGCG GCTCAATTGC TTGGGCCAAT
GGCTTAGAAC GCGCTGGCAT TTTGCAAGCC GCCGAATGCC AAGAACTGAT CGCAGGCTTG
CGCCAAGTCT ATCAAGAGTT TGAAGAAGGC CTGTTTTTGC CATTAACCAG CGACGAAGAT
ATTCACACGG CGGTCGAACG CCGCTTGGGT GATTTTATCG GAGCCTTGGC GGGCAAATTG
CATACTGGCC GCTCCCGCAA CGATCAAGTG GCGACCGATA CCCGTTTGTG GACATTGGGC
GCACTCAAAT TGGCCGATGA TCTTATTCGT GATGTGCAAG CGGCCTTGCT GGAACAAGCC
AAAGCGGTTG GCGAAGCAAT GTTACCTGGC TATACCCACC TGCAACGGGC GCAGCCAGTC
TTGTTATCGC ACGCACTTTT AGCCCATTTT TGGCGTTTAG ATCGCGATCG CCAACGCTTG
CACGATGCAA CCAAGCGCGT CAGCGTACTG CCGTTAGGCT CAGGCGCACT GGCTGGTACA
GCCTTCGCGG TTGATCGGGC GGCACTGGCA GCCGAATTAG GCTTTACCAG CATCAGCCAA
AATAGCCTCG ATGCTACCAG TGATCGCGAT TATATTGTCG AAATTTTGGC GGCAATCGCG
CTGTTGGGTG TCCACATTAG CCAGCTCGCC GAGGATTGGA TCATCTGGAG CAGCTCGGAA
TGGGGTTTTG TCGCGCTAGA CGATGCCTAT TCAACTGGCT CAAGTTTAAT GCCGCAGAAG
AAAAACCCTG ATTCGTTGGA GTTGGCTCGT GGCAAATCTG GCCGTTTGAT CGGCAATTTA
ATTACCGTTT TGACCTTGCT CAAAGGCCTG CCATCGGCCT ACGACAAAGA TTTACAAGAA
GATAAAGCGC CCTTATTCGA TGCGATCGAT ACATTAAGCC TGACGTTACC AGTGGTTGCA
GGTGCAATTC GTACCGCTCG TTTCAACACC GAACGCATGG AATCGGCGCT TGATGATGCA
ATGCTGGCAA CCGATGTAGC CGATGAATTG GTACGCCGGG GAGTGCCATT CCGCGAGGCG
CATCATATTG CTGGGCGTTT GGTGCGCGAA GCCGAACAAC GTGGGGTTGG CATGCGCCAA
TTGCCTGCCG AAAGCTTCGT AGCCGCCCAC CCAAGCCTGA CCGATGTTGC TGGTTTATTC
GATTTTGCTC GTTCCGTCGC CATGCGCGAC GTACCTGGTG GAACAGCGCC CAACGCCGTG
CGCGACCAAC TGATTGCTGC TCAACACGTT TTAGCAGAAG GTTGA
 
Protein sequence
MWGGRFSGSL AEHMRLFNDS FPIDRRLWAE DIRGSIAWAN GLERAGILQA AECQELIAGL 
RQVYQEFEEG LFLPLTSDED IHTAVERRLG DFIGALAGKL HTGRSRNDQV ATDTRLWTLG
ALKLADDLIR DVQAALLEQA KAVGEAMLPG YTHLQRAQPV LLSHALLAHF WRLDRDRQRL
HDATKRVSVL PLGSGALAGT AFAVDRAALA AELGFTSISQ NSLDATSDRD YIVEILAAIA
LLGVHISQLA EDWIIWSSSE WGFVALDDAY STGSSLMPQK KNPDSLELAR GKSGRLIGNL
ITVLTLLKGL PSAYDKDLQE DKAPLFDAID TLSLTLPVVA GAIRTARFNT ERMESALDDA
MLATDVADEL VRRGVPFREA HHIAGRLVRE AEQRGVGMRQ LPAESFVAAH PSLTDVAGLF
DFARSVAMRD VPGGTAPNAV RDQLIAAQHV LAEG