Gene Haur_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3648 
Symbol 
ID5735509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4588222 
End bp4589397 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID641280797 
ProductNLP/P60 protein 
Protein accessionYP_001546412 
Protein GI159900165 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGAT TGCCAATCCT CGCACGGGAT CGTCGGCGCT TGCAGATCGC TGGGTTATTG 
CTTGGCGCTG GGTTAACTGT GCCGTTGCTG CTTTGGTGGA CGTTTCCCGC AACCTCTCCC
GCGCCGCTTG GGGCAACGGC CACACCGCAA ATTGTGCGCG TCGGCGCAAC CCTCGATGAG
CCAATTCCTG CTGATTGGCA AGCCCCCGTT TTAAACCCCG AACTCGAAGC CCAGGCCTTG
GCTATCCCTG AAACCTTGCC GTTAACCGCT AGTGCGCTAT TGTATGATTC GATCTCTGCT
GATACGGTCT TTACTACAAC AATTGCTGGG TTGGTTGCCA GCGAGGAATT GAATTTGCGC
GATGGCCCGA GCGTTGATTA TTTGCCCATG GCGATTTTGC TCAACACCAC GCCGTTGACA
GTAGTTGGCC GATTTGAGGG CTGGCTGCAA GTTGTAACCC CGCAACGAGC GCTTGGTTGG
GTTGATGATA GTTATGTGGC CTTGGCCAGT TCAGCCCAAA CCCTGCCCCA AGTTAATCTG
CATGCCGACC CAAATCCAGT TTTAGTGGCG GGATTAACGG TTGAACGAGC TAATGTTCGC
TCGAAGCCGC AAACTGAAGC TGAAATTATC ACGACCTTGA GCGCTGAGCA TGGGCAAGTC
AATTTATTGC AACAACGTGA GGGTTGGTTC AATGTGCGCA CCAACGATGG CACTGAGGGC
TGGGTTTCCG CCGAACTGTT ACAAGCCGAT GCCTATATTT TGCGGCGTGT GCCAACCTTG
AGTGCCTCGC CCAACGCGCT TGAAGCGGTG CGTTTGGCCC GCAAATATGT AGGCTATCCC
TATGTTTGGG GCGGCGAAAC TCCGCGCGGT GGCTTCGATT GCTCAGGCTT GGTGCTGTAT
GTTTATGGCA AATTAGGCAT CGATATGCCC CATAGCGCCG CCGAACAATG GACTGGTGGT
TATGGCGAGA AAGTTGCTAG TCGCCGCGAT TTAGTGCCTG GCGATATTGT TTTTTTCAAA
AATACCTATA AAAAAGGCGT GAGCCATGTG GGCATTTATG CTGGCAATGG CAAAGTGATT
CAGGCGCTCT CCGAGAGTTT AGGCATTCGC GTTTCCGATT TATCCAATAG CTATTGGAGC
AGCCGCTATG TTGGGGCAAT TCGGCCATTT CCCTAG
 
Protein sequence
MPRLPILARD RRRLQIAGLL LGAGLTVPLL LWWTFPATSP APLGATATPQ IVRVGATLDE 
PIPADWQAPV LNPELEAQAL AIPETLPLTA SALLYDSISA DTVFTTTIAG LVASEELNLR
DGPSVDYLPM AILLNTTPLT VVGRFEGWLQ VVTPQRALGW VDDSYVALAS SAQTLPQVNL
HADPNPVLVA GLTVERANVR SKPQTEAEII TTLSAEHGQV NLLQQREGWF NVRTNDGTEG
WVSAELLQAD AYILRRVPTL SASPNALEAV RLARKYVGYP YVWGGETPRG GFDCSGLVLY
VYGKLGIDMP HSAAEQWTGG YGEKVASRRD LVPGDIVFFK NTYKKGVSHV GIYAGNGKVI
QALSESLGIR VSDLSNSYWS SRYVGAIRPF P