Gene Haur_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4140 
Symbol 
ID5736001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5288246 
End bp5289457 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID641281294 
Productargininosuccinate synthase 
Protein accessionYP_001546900 
Protein GI159900653 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAATG CGACTGCAAA TAAAGTTGTG CTGGCCTACT CTGGTGGTTT GGATACCAGC 
GTGATCGTGC CGTGGTTGAA GGAACACTAT GGCTGTGAAG TCGTCTGCTA TTGCGCCAAT
CTTGGCCAAG ATGACGATCT GAGCGGCGTA GAGGCCAAAG CGATTGCTTC GGGCGCAAGT
GCCTGCTATG TCGAAGATTT GCGCGAAGAA TTTGTGCGCG ATTTCTTGTT CCCGATGTTG
CAATCAGGTG CAACCTACGA ACGCACCTAC TTGCTCGGCA CAAGCATCGC CCGCCCATTG
ATCGCTCGTG GTCAGGTGCA AACCGCCTTG AAAGTTGGCG CTGATGCACT TTCTCACGGC
TGCACGGGCA AAGGCAACGA TCAAGTGCGC TTTGAATTGA CCTACATGGC CTTGGCTCCG
CATATGAAAA TTATCGCCCC ATGGCGTGAA TGGCATATTC GCTCACGGGA AGATGCATTG
GATTATGCTG CATTACATAA TGTGCCAGTC ACCAGCACTC GCGCCTCGAT CTACAGTCGC
GATGGCAATA TTTGGCACTT GAGCCATGAA GGTGGCTCGT TGGAAGATCC ATGGTTAGAA
CCAGAATTGA CCATGTTCCA ACGCACCGTG ACCCCTGAAG AAGCTCCCGA TTTACCAGAA
TACTTGGAAA TTGGCTTCGA ACGTGGTATT CCGGTGAGTG TGAATGGCGA GCAACTTGGC
CCCGTGGCGT TGCTGCAAAC CCTTAACGAC ATTGGCGCTC GTCATGGCGT TGGTCGCGTC
GATTTGGTCG AAAATCGCTT GGTTGGCATG AAAAGCCATG GCGTATACGA AACACCAGGC
GGTACATTGC TCTATCGTGC CCACCAAGGC CTCGAAGAAT TGGCGCTTGA TCGCGAAACC
TTGCACTTCA AAGATACCTT GGCAATCCGC TTCTCAGAGT TGATTTACAA CGGTCAATGG
TGGTCGCCGT TGCGCTATGC CTTGTCGGCC TTCTTCACTG AAACCCAAAA GAACGTTACA
GGCGTGACTC GCTTGAAAGT CTTCAAAGGT GGCGTGTATT TGGTTGGGCG CAAGGCTGAA
CGCAGCTTGT ACGTACCCGA TTTGGCAACC TTCAGCGAAG ATGCCGTTTA TAACCAAGCC
GATGCTGAAG GCTTTATCAA GCTCTTTGGC TTGCCTCAAA AAGTCGAAGC CTTAACCTCA
GGAGCCGAAT AA
 
Protein sequence
MSNATANKVV LAYSGGLDTS VIVPWLKEHY GCEVVCYCAN LGQDDDLSGV EAKAIASGAS 
ACYVEDLREE FVRDFLFPML QSGATYERTY LLGTSIARPL IARGQVQTAL KVGADALSHG
CTGKGNDQVR FELTYMALAP HMKIIAPWRE WHIRSREDAL DYAALHNVPV TSTRASIYSR
DGNIWHLSHE GGSLEDPWLE PELTMFQRTV TPEEAPDLPE YLEIGFERGI PVSVNGEQLG
PVALLQTLND IGARHGVGRV DLVENRLVGM KSHGVYETPG GTLLYRAHQG LEELALDRET
LHFKDTLAIR FSELIYNGQW WSPLRYALSA FFTETQKNVT GVTRLKVFKG GVYLVGRKAE
RSLYVPDLAT FSEDAVYNQA DAEGFIKLFG LPQKVEALTS GAE