Gene Haur_2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2217 
Symbol 
ID5734104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2816608 
End bp2817996 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content50% 
IMG OID641279358 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001544985 
Protein GI159898738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTTC CACCGACCAA TCAGCATGGG CCGCTAGCCC GCTGTATTGT GGTCAAATTC 
CACGATCAGC AGGCTCCGCT AATCTATACC GATCAAAATT TTCAAGGCTA TGTTGTTCAA
CCGCTCTTTA ACAGCGTCGA TCCTGATCAA TTAGTTGGCT TAGTTCAGCA AGCCCAGCAA
ACTAACCCAG AATATCAAGC ACCAAATTTT TTGGCCTTCT TTGCGATTGA TGTTGAGCCA
GACCAAGATC CCTTTGCAAT CATTGAGGAA ATTAACCAAT GGGCAGAACT TGAGTATGCC
TATGTTGAAT CGCCGCCAGC ACCTGTGCCC ACTGATGCTA ACCCTCGCCG CGCACGCCAA
ACCTATCTCA ACCCACCAAG CAGCACTGGC CCAACCATTG GCGGCATCGA TGCTGAAGCG
GCTTGGAAGG TTTTAGAGCA TGCAGGCAAA GCCATCACGA TTGTGGATAT TGAGAAATCA
TGGCAATTAG AACACCCCGA TTTGCTGCAA CATGGCAGTT CGCCGATTAC GATCTTGCCC
TCACTGCTGC ATTGTGATAT GCATGATCCC GCTTGTGCTG ATCATGGCAC GAATGTTTTA
GGTGTGCTGG TGGCCCAAAA TAATACTGAG GGTGGAGTTG GAATCGCTCA CGACGCAGCA
GCAGCAGTCA TTTCGCCCTG GCAAAAACCA AGCAATGGCA CCAACCAACC AAGCTGGAAT
ATCGCCAATG CGATTGTCGC CGCAAGCAAC TACTTAACAA CACTTGAATT ATCAGGTAAT
CTGATCCTGC TAGAATTGCA AATCTACCAA GATCTGGCTG GCGGACCCTA CACCAACACG
CCCAATCAAC CAGGCAGATT GCTACCAGTC GAATTAGAAC CAGCCAATTT CGAGGCCATT
CGGTTGGCGA GCGAACTGGG CATTATTGTG ATCGAGGCTG CTGGAAATGG CGCTAGCGAT
TTGGCAACGT GTTGGGATAC CGTGGGAACA TATCAAATTG AGCCAGAAAC AGCCCGTTAT
CGTGATTCTG GGGCAATTCT AGTTGGAGCG GTTTACAGTC GCGACCCCAA TAAAGCAACG
CGAACCGCTA GCTCCAACTA TGGACAACGG GTCAATTGCT TCGCTTGGGG CAATGGTGTA
TTTACCACCA ATGCTTCGGG CTATAGCTTA AGCTTTGGCG GAACATCAGC AGCAGCAGCG
ATTATTGCCG GAGCCGCCAT TTTAGCGCAA GCAATTGGCG AACAGCTGCG CCAAGCACGA
TTCAGCCCCG AGGAATTACG CAGGTTGCTT ACCCACCCAG ATGCCTGTAC CTATTCAGCC
CAGCCCCAAC ATGATCGGGT TGGCGTTATG CCAGATCTAG GGCGCATTAT TGGCTTGTTG
CAGGTTTAA
 
Protein sequence
MLFPPTNQHG PLARCIVVKF HDQQAPLIYT DQNFQGYVVQ PLFNSVDPDQ LVGLVQQAQQ 
TNPEYQAPNF LAFFAIDVEP DQDPFAIIEE INQWAELEYA YVESPPAPVP TDANPRRARQ
TYLNPPSSTG PTIGGIDAEA AWKVLEHAGK AITIVDIEKS WQLEHPDLLQ HGSSPITILP
SLLHCDMHDP ACADHGTNVL GVLVAQNNTE GGVGIAHDAA AAVISPWQKP SNGTNQPSWN
IANAIVAASN YLTTLELSGN LILLELQIYQ DLAGGPYTNT PNQPGRLLPV ELEPANFEAI
RLASELGIIV IEAAGNGASD LATCWDTVGT YQIEPETARY RDSGAILVGA VYSRDPNKAT
RTASSNYGQR VNCFAWGNGV FTTNASGYSL SFGGTSAAAA IIAGAAILAQ AIGEQLRQAR
FSPEELRRLL THPDACTYSA QPQHDRVGVM PDLGRIIGLL QV