Gene Haur_4948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4948 
Symbol 
ID5736784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6275220 
End bp6276344 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content52% 
IMG OID641282115 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001547706 
Protein GI159901459 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.809867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCTTC GTGATCGTTT GGGTTGGATG CTGAGTGGTC TATTGCTGGG CATGCTATTG 
ATGGTTAGTT GCGATGTGGT CAATCAAGCT TCTACGGCTC AGCCAAGTGT GGTTGATGCT
GCCGCTGTGC CTTCAACTGG ACCATTGGCC ACCGCTGCTG CCCAAATGCC TGTTAATCAG
GCTGGCGTTG ATGCCTATAG CAATGTGATT CGGGCCGTTT ACAATCGCGG TAATCCTTCG
GTGGTGCGAA TTGACGTGCA AAGTGAGCAA GGCGAATCGC TCGGAACAGG TTTTGTGATC
GATAAACAGG GCCATATTGT CACCAACAAT CACGTTGTTG GCAGCAGCCG CAGCGTGTTA
GTCAATTTTA TCGATGGTGA TGCAGGGATC GCCGATGTGA TTGGCGTTGA TAGCGATTCA
GATTTGGCAG TAATTAAAAT GCGTAATCCT GATCCTGCTA TTCTGATTCC TGTTGAGTTT
GGCGATTCGG CGGCGGTGCA AGTTGGCGAT GTGGTGGTAG CGATTGGGAA TCCCTATGGT
GAAAATCGTA CCGCGACTGC GGGGATTATT AGTGCGATTC GTGGAGCCAA GAATGAGGGT
GGCGGCAGTA CCTTTTCAAT TCCTGGGGTG TTGCAAACCG ATGCGGCGAT TAACCCAGGC
AATTCGGGCG GGCCATTGTT CAACAGCCAA GGCCAAGTAA TTGGGGTCAA TACCTTTATT
CTCGACCCAT CGGGGCGGGG CGCGAATATT GGCTTGGGTT TTGCAGTGCC GATTAATTTG
GTTAAGCTGG TGGCCCCAGC GATTATTCGC GATGGCAGCT ATACGCATCC ATTCTTTGGC
GCGGCGGTAA GTAGCGTTGA TAGCTATTTT GCTGAAGTTA ATAATTTACC AAGCAAAGGC
ATTATTATTA CCCAACTCTA CAATGGCCCT GCTGCCGAGG CTGGCTTGCA AGTGGGCGAT
GTGATTGTCT CGGTTAATGG TGAGCCAATG CTTGAAGCTG GCGATCTGAT CACGCTCTTA
GAATTAACCA CCCAACCAGG TGATCGGATG ACGGTTACGG TGGCCGAGGG CAATGGCCGC
ACCCGTGATG TGCAAGTGCT GGTCGGGGCA CGTCCAGGTC GCTAA
 
Protein sequence
MQLRDRLGWM LSGLLLGMLL MVSCDVVNQA STAQPSVVDA AAVPSTGPLA TAAAQMPVNQ 
AGVDAYSNVI RAVYNRGNPS VVRIDVQSEQ GESLGTGFVI DKQGHIVTNN HVVGSSRSVL
VNFIDGDAGI ADVIGVDSDS DLAVIKMRNP DPAILIPVEF GDSAAVQVGD VVVAIGNPYG
ENRTATAGII SAIRGAKNEG GGSTFSIPGV LQTDAAINPG NSGGPLFNSQ GQVIGVNTFI
LDPSGRGANI GLGFAVPINL VKLVAPAIIR DGSYTHPFFG AAVSSVDSYF AEVNNLPSKG
IIITQLYNGP AAEAGLQVGD VIVSVNGEPM LEAGDLITLL ELTTQPGDRM TVTVAEGNGR
TRDVQVLVGA RPGR