Gene Haur_4384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4384 
Symbol 
ID5736234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5599779 
End bp5601449 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content52% 
IMG OID641281546 
ProductNLP/P60 protein 
Protein accessionYP_001547144 
Protein GI159900897 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000240419 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATG CTAAGTCCTC GAATGATCTG ATCCAAGAAG ATCATGATCT GCAAGACTTC 
CAGCAGCGGG CTGTGAACAA CGAAAATTCG CTTTTACGAA CGACCAGTTT ACGCCAGCGC
GTTAACGCCC GTTCGGTTTC TGCACTTCGT GCGGTTCCAT CGTACCTGCG CAAAGCTCCA
CGGCGCTACC TACTTCATCT TATGGTGCTT TCGCTCTTAC CAGTTGGTTT AGTTGTCAAC
AAAGATGCCA CTAAGCCTCA GGTAGATACT GCACTCTTGG TCAGCGCATC GCCAACTGCT
GAACGCCAAG TTCGGCCAGC ACTCGGCTTG ATGACCATGA CCCACCGCAA CGAACCAGCT
CCGTTGACTG CTCTCAATGA TTCAGAAGCT ACGCCTGACC CAGGTGTTGG CGATGGGCCA
ATTAGCAGCC CCGATTTTGA TGATTCGTTG GTGATTCCAG TAGGCCGCCC AGTTAATAAC
AACCCGACCT ATCCCGAATC AGTAGTGAGC GCCGATATTG CCAACTTGCG CAATGGCCCA
AGCACTGAAT TCGATCGTCT CGATAAATTA GAACCAGGCA CCAAGGTAAC CGTTGTGGCT
CGCCACGCCG ATTGGGTGCA AGTGCGCACC GAAGGCGGCC AAGAAGGTTG GCTCGCCGCT
GATTTGCTTG ATTTAGAGCA ATCGGTGATC GATGCTTTGC CTGATGCCCA AAATATTCCA
ACCCCACCAC CAGCCAAAGT GGGCAAGATC ACCCAAGATA ATTTGAACTT GCGTGATGGC
CCTGGTACTG ACTACATCAG CATGAAAAAG CTGGGAATTG ATAGCCAAGT TTCATTGTTG
GCCCGCTATC AAGGCTGGTA TCAAATTGAA ACTGGTGAAG GTAATGTGGG TTGGGTTTCA
GCCGAATTCT TGAATCTTGA AGCTGGCGTT GCCGAACGGA TCGCCGAAGC TGAATCGATT
CCATCAGCTA ACCCCGATTT GGTCGGTTGG GCAACTGATG AAGGCATCAA CTTGCGCTCT
GGCCCAAGCA CCAAATTCGA TTCATTGGGC AAACTGAGCA AAGGCGCTGA ATTAACCTTA
TTAGCTCGTT ACAAAGAATG GGTCAAGGTT CAAACCGCCA AAGGTACCAA AGGCTGGATC
TCACAAGATT TAGTTGATGT CAGCAACTTT GTGTTCCGCC GTGTACCATT CACAACCAAT
GTGCCCTCAT TACCAGTTGC TCCAGCTGCC CCCAAAAAGA GCACTCCTAG CCAACCTGCT
GGTGGCGGTG GCGGCGGTGG TGGTACTGCT AGCGGCGACG TTGCTTCGAT GGCTTGGGCC
TATGTTGGCT ACAACTATCG CTGGGGCGGC GAAAGCCCAA GCAGCGGCTT CGATTGCAGC
GGCTTGACCA AGTATTTGTA TCGCCAAGTT GGGGTCAGCT TGCCCCACAG TGCCGCTGGC
CAATATAGCA GCGCTTATGG CACCTTCATC GGAAGCATGA GCAACTTGCA ACCAGGCGAT
TTGCTGTTCT ATGCTGGCAC TGCTGGCCCG GGCATCACCC ACGTAGGCAT CTACGTTGGC
GGTGGTGTGA TGGTCAATGC GATGACTCCC GCTTCGGGGG TTGGTGCAGT CAGCATCTAT
AGCAGCTACT GGCTCAATCA CTATTACGGC GCGTTGCGGC CTTATCGCTA G
 
Protein sequence
MQDAKSSNDL IQEDHDLQDF QQRAVNNENS LLRTTSLRQR VNARSVSALR AVPSYLRKAP 
RRYLLHLMVL SLLPVGLVVN KDATKPQVDT ALLVSASPTA ERQVRPALGL MTMTHRNEPA
PLTALNDSEA TPDPGVGDGP ISSPDFDDSL VIPVGRPVNN NPTYPESVVS ADIANLRNGP
STEFDRLDKL EPGTKVTVVA RHADWVQVRT EGGQEGWLAA DLLDLEQSVI DALPDAQNIP
TPPPAKVGKI TQDNLNLRDG PGTDYISMKK LGIDSQVSLL ARYQGWYQIE TGEGNVGWVS
AEFLNLEAGV AERIAEAESI PSANPDLVGW ATDEGINLRS GPSTKFDSLG KLSKGAELTL
LARYKEWVKV QTAKGTKGWI SQDLVDVSNF VFRRVPFTTN VPSLPVAPAA PKKSTPSQPA
GGGGGGGGTA SGDVASMAWA YVGYNYRWGG ESPSSGFDCS GLTKYLYRQV GVSLPHSAAG
QYSSAYGTFI GSMSNLQPGD LLFYAGTAGP GITHVGIYVG GGVMVNAMTP ASGVGAVSIY
SSYWLNHYYG ALRPYR