Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4384 |
Symbol | |
ID | 5736234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5599779 |
End bp | 5601449 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281546 |
Product | NLP/P60 protein |
Protein accession | YP_001547144 |
Protein GI | 159900897 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000240419 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATG CTAAGTCCTC GAATGATCTG ATCCAAGAAG ATCATGATCT GCAAGACTTC CAGCAGCGGG CTGTGAACAA CGAAAATTCG CTTTTACGAA CGACCAGTTT ACGCCAGCGC GTTAACGCCC GTTCGGTTTC TGCACTTCGT GCGGTTCCAT CGTACCTGCG CAAAGCTCCA CGGCGCTACC TACTTCATCT TATGGTGCTT TCGCTCTTAC CAGTTGGTTT AGTTGTCAAC AAAGATGCCA CTAAGCCTCA GGTAGATACT GCACTCTTGG TCAGCGCATC GCCAACTGCT GAACGCCAAG TTCGGCCAGC ACTCGGCTTG ATGACCATGA CCCACCGCAA CGAACCAGCT CCGTTGACTG CTCTCAATGA TTCAGAAGCT ACGCCTGACC CAGGTGTTGG CGATGGGCCA ATTAGCAGCC CCGATTTTGA TGATTCGTTG GTGATTCCAG TAGGCCGCCC AGTTAATAAC AACCCGACCT ATCCCGAATC AGTAGTGAGC GCCGATATTG CCAACTTGCG CAATGGCCCA AGCACTGAAT TCGATCGTCT CGATAAATTA GAACCAGGCA CCAAGGTAAC CGTTGTGGCT CGCCACGCCG ATTGGGTGCA AGTGCGCACC GAAGGCGGCC AAGAAGGTTG GCTCGCCGCT GATTTGCTTG ATTTAGAGCA ATCGGTGATC GATGCTTTGC CTGATGCCCA AAATATTCCA ACCCCACCAC CAGCCAAAGT GGGCAAGATC ACCCAAGATA ATTTGAACTT GCGTGATGGC CCTGGTACTG ACTACATCAG CATGAAAAAG CTGGGAATTG ATAGCCAAGT TTCATTGTTG GCCCGCTATC AAGGCTGGTA TCAAATTGAA ACTGGTGAAG GTAATGTGGG TTGGGTTTCA GCCGAATTCT TGAATCTTGA AGCTGGCGTT GCCGAACGGA TCGCCGAAGC TGAATCGATT CCATCAGCTA ACCCCGATTT GGTCGGTTGG GCAACTGATG AAGGCATCAA CTTGCGCTCT GGCCCAAGCA CCAAATTCGA TTCATTGGGC AAACTGAGCA AAGGCGCTGA ATTAACCTTA TTAGCTCGTT ACAAAGAATG GGTCAAGGTT CAAACCGCCA AAGGTACCAA AGGCTGGATC TCACAAGATT TAGTTGATGT CAGCAACTTT GTGTTCCGCC GTGTACCATT CACAACCAAT GTGCCCTCAT TACCAGTTGC TCCAGCTGCC CCCAAAAAGA GCACTCCTAG CCAACCTGCT GGTGGCGGTG GCGGCGGTGG TGGTACTGCT AGCGGCGACG TTGCTTCGAT GGCTTGGGCC TATGTTGGCT ACAACTATCG CTGGGGCGGC GAAAGCCCAA GCAGCGGCTT CGATTGCAGC GGCTTGACCA AGTATTTGTA TCGCCAAGTT GGGGTCAGCT TGCCCCACAG TGCCGCTGGC CAATATAGCA GCGCTTATGG CACCTTCATC GGAAGCATGA GCAACTTGCA ACCAGGCGAT TTGCTGTTCT ATGCTGGCAC TGCTGGCCCG GGCATCACCC ACGTAGGCAT CTACGTTGGC GGTGGTGTGA TGGTCAATGC GATGACTCCC GCTTCGGGGG TTGGTGCAGT CAGCATCTAT AGCAGCTACT GGCTCAATCA CTATTACGGC GCGTTGCGGC CTTATCGCTA G
|
Protein sequence | MQDAKSSNDL IQEDHDLQDF QQRAVNNENS LLRTTSLRQR VNARSVSALR AVPSYLRKAP RRYLLHLMVL SLLPVGLVVN KDATKPQVDT ALLVSASPTA ERQVRPALGL MTMTHRNEPA PLTALNDSEA TPDPGVGDGP ISSPDFDDSL VIPVGRPVNN NPTYPESVVS ADIANLRNGP STEFDRLDKL EPGTKVTVVA RHADWVQVRT EGGQEGWLAA DLLDLEQSVI DALPDAQNIP TPPPAKVGKI TQDNLNLRDG PGTDYISMKK LGIDSQVSLL ARYQGWYQIE TGEGNVGWVS AEFLNLEAGV AERIAEAESI PSANPDLVGW ATDEGINLRS GPSTKFDSLG KLSKGAELTL LARYKEWVKV QTAKGTKGWI SQDLVDVSNF VFRRVPFTTN VPSLPVAPAA PKKSTPSQPA GGGGGGGGTA SGDVASMAWA YVGYNYRWGG ESPSSGFDCS GLTKYLYRQV GVSLPHSAAG QYSSAYGTFI GSMSNLQPGD LLFYAGTAGP GITHVGIYVG GGVMVNAMTP ASGVGAVSIY SSYWLNHYYG ALRPYR
|
| |