Gene Haur_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0343 
Symbol 
ID5732253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp409107 
End bp411299 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content47% 
IMG OID641277467 
Productproprotein convertase P 
Protein accessionYP_001543123 
Protein GI159896876 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000557566 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTAC GGATTGGTTT AATCATCACG CTTTTGTTGA TGATCACCGT TGGAAAAACC 
CTGCAAAACC CCAGGGATGT AGCCCAAGCC CTGTCCCAAA CCCCTGATCA AACCACAATT
CCGCAAGGAT TTTCCCGCAT CCAAGAGCAA GAATTTAACG ACTTACCACT CAATGCCAAT
CTGTTGGTTG GCAGCAGTTT GGTGGTGCGC GGCTCAATTC AGCAAACTGA TGTTGATTAT
TTTGCGCTTG ATTTAACCGC TGGTCAGCGT TTAGCAATGG CCACCATTAC TAGCGCCAGC
GTCGCCTCCA GCGATACAAC CATCCATCTC TATGCTTTCG ATGGCGTAAA TAGTGATCTG
ATCGAAACCG ATTTGAGCGA TGGCATTCTG AGCAACAGTT CCTCGGTGAT CAGTTCTCAG
CCAATTACGC TGACCGGAAC CTACCTGATT AAAGTTGAAG GTGGCACTGC AACCACGGTT
ATTCAGCCCT ACGACTTATA TGTGCGAGTG CTGAGCGAAA CTGCCAGCGA ACAAGAACCT
AACGACGAAG TTGCCCAAAC CATCGATGCT CAAGCGGCAA TCAGCGGGGT AGTTTCAACC
ACCAACGATC TTGATCGCTA TCAATTTAAT GTTAATCCTG GCGACACAAT TTTTGCCACC
GTTGATTTTG ATCCAGAGCG TGATGGCATA ACTTGGAATG GCTTTCTCGA TATTGGCATG
ATCAACAATA CCTATCTTCG GGCTAACGAT AGTAATAGTG TTTCACCCAA CGCTGAAGCC
AATGTGATCA CCGTTCAACA AGCTGGCACT TACGAAATTC GCATTGGCTC ACTACTTGAG
ATCGGTGTGG ATGCTAGTTA TTTGGCGCAA GTTACGATTA TTCCCGCCGC TATTCAGGCC
AACTGCCAAA CCGTGATGAG TTCAGGTGCA CCACAGAATA TTGGCCCACA AGCTGGAATA
ATTCAATCGA CATTAACCGT CACCCAAGCA GCCAACATTG CCGATATTGA TGTATTGCTC
AATTTAGAGC ATAGCTTCAT GCCCGATTTG GATGTAACCC TGACTGCCCC CGATGGTAAT
GTGATTAACC TCTTCACCGA TATTGGCAAT GTGCAGCAAC CTACCGTTAA TCTCGTGATT
GATCAACAAG CTGCCTTACC ACTTGGCACG TATAACGTTT TGAGTGGCAC GCATTTTGGT
CCGAAATGGA ATAGCAGCCT CGATTGGTTG GCTGGACAAC AAGCCCAAGG CCAATGGATA
CTGACGATTT ATGATGATAC CGACCAAAAT GCTGGTGTAT TAAATGGTTG GGGCTTGCGG
ATTTGTGGCA TGCCCACGCC AAGCGATTGT CCAGTCGGGA TGTCGCGCAG CGTGCTTTAT
AGCAGCCAAT TTGAGGCCGA TAATGGCGGG CTTACTCCAG GCCCATTTGA CCAAGAATGG
GTTTGGGGTA ATCGTAATAG CCCACCAATT GTTGGTGCGT ATAGCGGCGA AAATAGCTGG
AATACCAATT TAACCGGAAA TTATCCTAAT AGTACCCGCA TGCAATTGCT ATCACCGCAA
ATTGACTTAA CCAATGTTAC GGGGCCAATT TATGCAAGTT GGTATCAACG CTATCAGCTT
GATAATAGTG TTAACGATTT TTATCAGGTA ACGGCCCATA AGCCCCAAGT TGAACAGATT
CTCTTTCGCC ATCAAAGTGC TGCAATGCAA ATTAACCTCG GAAATCCTTT GGTTACGCTT
GATCAAAGTA CAGGTTGGGG ACTTCAACGC CATGATCTAA GCGATTTTGC TGGCACTTCA
CTCTATTTAA CCTGGGATTT CGGTAGTGAT GAGGTAGCAA GTTTTGCTGG GATTGCGCTT
GATGATGTGG AAATTACTGG TTGTATTGAT CCAGCTCAAA TCACGCCAAC GAATACGCCA
ACGCCAAGCA ACACGCCAAC CCTAACATCA ACTCCTAGCA ATACGCCAAC GCCAAGCAAT
ACGCCAACGG CGACCGCAAC ACCAACCCAA ACCTTAACGC CAACTGAAAC GCCAACGCCA
AGCAATACGC CAACGGCGAC CGCAACGCCG ACCCAGACCG AAACGCCAAC CGCGACTGAA
ACACCAAGTA TCACCCCAAC GAGTACACCA AGTGTAACGG CAGATCCAAG CTTAATCCCG
GTCTATCTGC CTTTAGTCAG TAAAGATAAT TAA
 
Protein sequence
MRLRIGLIIT LLLMITVGKT LQNPRDVAQA LSQTPDQTTI PQGFSRIQEQ EFNDLPLNAN 
LLVGSSLVVR GSIQQTDVDY FALDLTAGQR LAMATITSAS VASSDTTIHL YAFDGVNSDL
IETDLSDGIL SNSSSVISSQ PITLTGTYLI KVEGGTATTV IQPYDLYVRV LSETASEQEP
NDEVAQTIDA QAAISGVVST TNDLDRYQFN VNPGDTIFAT VDFDPERDGI TWNGFLDIGM
INNTYLRAND SNSVSPNAEA NVITVQQAGT YEIRIGSLLE IGVDASYLAQ VTIIPAAIQA
NCQTVMSSGA PQNIGPQAGI IQSTLTVTQA ANIADIDVLL NLEHSFMPDL DVTLTAPDGN
VINLFTDIGN VQQPTVNLVI DQQAALPLGT YNVLSGTHFG PKWNSSLDWL AGQQAQGQWI
LTIYDDTDQN AGVLNGWGLR ICGMPTPSDC PVGMSRSVLY SSQFEADNGG LTPGPFDQEW
VWGNRNSPPI VGAYSGENSW NTNLTGNYPN STRMQLLSPQ IDLTNVTGPI YASWYQRYQL
DNSVNDFYQV TAHKPQVEQI LFRHQSAAMQ INLGNPLVTL DQSTGWGLQR HDLSDFAGTS
LYLTWDFGSD EVASFAGIAL DDVEITGCID PAQITPTNTP TPSNTPTLTS TPSNTPTPSN
TPTATATPTQ TLTPTETPTP SNTPTATATP TQTETPTATE TPSITPTSTP SVTADPSLIP
VYLPLVSKDN