Gene Haur_1651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1651 
Symbol 
ID5733535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1918183 
End bp1919148 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content53% 
IMG OID641278790 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001544422 
Protein GI159898175 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3591] V8-like Glu-specific endopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000137982 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTAT CTGTTCGGCG GTACGCACGG GTCGCGGTGG TTTTGAGCGT TTTAGGCTTG 
AGCTTGGCCC AAGGCGATGT GACCTTAGCC AAGAAGGAAG TCGAAGCTGG GGTTGACCCA
CATACCATCG TGGCTAGTGA TGGGAAGCCT GTAGACGTTA TTACTGATGG CATTGTTTAT
GATCAATTTG GGGCCTATAT CGCTGGTAAC GAAGGAACTG GTGTTTTAGC CGAAGATGAT
CGAGCCGAGG CCGTTGATCC CAATGCCTTG CCTGCAACCC AAAGTGGCAG CCAAGAAGGC
TTAAATTCAG TGATTGGTAC TGATAATCGC GTGCGAATCA CCGCGACGAC CTCAGACCCC
TATCGCCGGA TCGGCCAAAT TACCTTTAGC AGTGGCGGCG GCAACTACAT TTGTACTGGT
TGGTTGATCA GCGCCAACAC CGTGGCAACC GCAGGCCACT GTCTCTGGAG CAACAACGCT
TGGGTCACCA ACGTTAAGTT CTACCCAGGT CGCAATGGCA CATCGAACCC TTACGGCGGC
TGTAACGCCA CCAAACTCTT TACGGTTTCA CAATGGCAAA CCAGTGGCAG CCCCAACTAC
GATTATGGTG CATTCAAAAT TAATTGTAGT GTTGGCAGCC AAACTGGCTG GTTCGGCTTG
CGTGCCCCAA GCAACACCGG CTTAGTTGGC CAAGTAACCA ACATTGCTGG CTACCCAGGC
GATAAAACCT CGGGCACGAT GTGGTTCCAC GCCGATACCG TGCGCAGCTA CACCAGCCTA
CGACTCTCGT ATGCCAACGA CACCTATGGC GGCCAAAGTG GCTCACCAAT TTGGAACAGC
AGCGGCAGCT GCACCAACTG TTCGATTGGC GTACACACCA ATGGCGGTAC CACCACCAAC
TCTGGTACAC GCATCACCTC AACCGTCTTG AGCAACTTCA ACACCTGGAT CAATACTGCC
CCATAA
 
Protein sequence
MHLSVRRYAR VAVVLSVLGL SLAQGDVTLA KKEVEAGVDP HTIVASDGKP VDVITDGIVY 
DQFGAYIAGN EGTGVLAEDD RAEAVDPNAL PATQSGSQEG LNSVIGTDNR VRITATTSDP
YRRIGQITFS SGGGNYICTG WLISANTVAT AGHCLWSNNA WVTNVKFYPG RNGTSNPYGG
CNATKLFTVS QWQTSGSPNY DYGAFKINCS VGSQTGWFGL RAPSNTGLVG QVTNIAGYPG
DKTSGTMWFH ADTVRSYTSL RLSYANDTYG GQSGSPIWNS SGSCTNCSIG VHTNGGTTTN
SGTRITSTVL SNFNTWINTA P